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HIGHER ORDER MARKOV RANDOM FIELDS FOR INDEPENDENT SETS 

By David A. Goldberg 
Georgia Institute of Technology 

We consider higher order Markov random fields to study indepen- 
dent sets in regular graphs of large girth (i.e. Bethe lattice, Cayley 
tree). We give sufficient conditions for a second-order homogenous 
CO . isotropic Markov random field to exhibit long-range boundary in- 

dependence (i.e. decay of correlations, unique infinite-volume Gibbs 
measure) , and give both necessary and sufficient conditions when the 
relevant clique potentials of the corresponding Gibbs measure satisfy 
' a log-convexity assumption. We gain further insight into this charac- 

terization by interpreting our model as a multi-dimensional perturba- 
tion of the hardcore model, and (under a convexity assumption) give 
CO ' a simple polyhedral characterization for those perturbations (around 

the well-studied critical activity of the hardcore model) which main- 
tain long-range boundary independence. After identifying several fea- 
tures of this polyhedron, we also characterize (again as a polyhedral 
set) how one can change the occupancy probabilities through such a 
perturbation. We then use linear programming to analyze the proper- 
ties of this set of attainable probabilities, showing that although one 
cannot acheive denser independent sets, it is possible to optimize the 
number of excluded nodes which are adjacent to no included nodes. 



o 

CN 



Oh 



CN , 1. Introduction. Recently, there has been a significant interest in combining ideas from prob- 

^ \ ability, computer science, physics, statistics, and operations research, to shed light on the structure 

and complexity of combinatorial optimization, counting, and sampling problems [83], [72], [1], [103]. 
Some of the most well-studied such problems involve the independent sets (i.s.) of a graph. Consider 
an undirected graph G, which consists of a set of nodes V and edges E, where each edge e £ E is of 
the form (vi,Vj) for some Vi,Vj € V. Then the i.s. of G, X(G), are defined to be the subsets S of V 
with no internal edges; i.e. a set S £ 2 V is an i.s. iff for all pairs of nodes Vi, vj € S, (vi,vj) ^ E. I.s. 
arise in many applications, ranging from the study of communication networks [102] to computer 
vision [16], economics [29], and biology [63]; and the problem of finding the maximum i.s. of a 
graph was one of the original NP-Complete problems identified in Garey and Johnson's classical 
text on computational intractibility [88]. There are a wealth of results about the complexity and 
(in)approximability of counting, sampling, and optimizing i.s. under various restrictions. We make 
no attempt to survey that literature here, instead focusing only on the results most relevant to our 
own investigations. 

1.1. Markov random fields and Gibbs measures. Markov random fields (M.r.f.), and the asso- 
ciated Gibbs measures, have proven to be a powerful framework for studying these (and related) 
questions. Recall that a collection of r.v.s X%, . . . ,X n defines a binary M.r.f. if the following is 
true [8], [79]. First, X.- L has support on {0,1} for all i. Second, to each r.v. X,- L , we can associate a 
subset Ni of the indices {1, . . . ,n}\i (called the neighborhood of Xi), such that for any binary 
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re-dimensional vector x, P( {Xj = Xj}) > implies 

je{i,...,n}V 



F{X t = 1\ f| {X, = x,}) = ¥{X t = 1| P| {X, = x,}). 




In words, the conditional probability that Xi takes a given value if we condition on all other r.v.s is 
the same as the corresponding conditional probability if we only condition on the r.v.s belonging to 
the neighborhood of Xj. We further suppose that the neighborhood relation is symmetric, i.e. i £ Nj 
iff j £ N{. In this case, note that the neighborhood structure of the M.r.f. defines an undirected 
graph, in which there is a node for each variable, and two nodes are adjacent iff they are neighbors. 
Of course, one can view any collection of r.v.s as a trivial M.r.f. by setting iVj = {1, . . . , n} \ i for 
all i. However, many interesting probabilistic models enjoy a non-trivial conditional independence 
structure, e.g. the class of graphical models and their applications to fields ranging from computer 
vision to biology [117]. 

A closely related concept is that of the Gibbs measure. Recall that the cliques of a graph G, 
C(G), are defined to be the subsets S of V with all internal edges present; i.e. a set 5 S 2 V is a 
clique iff for all pairs of distinct nodes Vi,Vj € S, (vi,Vj) € E. For each clique C G C(G), we define 
a non- negative function ttq (called the clique potential), which takes as input binary values for all 
the nodes belonging to C, and outputs a non-negative real number. For a graph G, let \G\ denote 
the number of nodes. For a binary |G| -dimensional vector x and clique C £ C(G), let xp denote 
the projection of the vector x onto the indices of nodes present in the clique C. Then given a graph 
G, a binary r.v. Xi associated to each node Vi € V, and a potential function ire for each clique 
C € C(G), we define the associated binary Gibbs measure P on the r.v.s X\, . . . ,X n as follows. For 
every binary |G| -dimensional vector x, 



There is a deep connection between M.r.f. and Gibbs measures. In particular, under certain regu- 
larity conditions, the joint distribution of X\, . . . , X n is given by a M.r.f. iff the joint distribution of 
X\, . . . , X n is a Gibbs measure, whose underlying graph is defined by the neighborhood structure of 
the given M.r.f. The celebrated Hammersley-Clifford Theorem, as well as several follow-up works, 
make these regularity conditions explicit [58], [56], [8], [54]. 

As M.r.f. and Gibbs measures provide an elegant way to describe a rich family of non-trivial 
correlation structures on collections of r.v.s, the framework of M.r.f. and Gibbs measures has had 
a significant impact on many fields, including statistics, physics, computer vision, geology, biology, 
combinatorial optimization, and operations research [117], [8], [100] [42], [79], [91], [127], [121], [65]. 

1.2. Applications of M.r.f. to physics and combinatorial optimization. We now make the con- 
nection between certain models in physics and combinatorial optimization (i.s. in particular) to 
Gibbs measures (and their associated normalizing constants) more explicit. In his seminal Ph.D. 
thesis [37], Ernst Ising defined the so-called Ising model, to study the properties of particle sys- 
tems which exhibit pairwise attraction and repulsion. Namely, suppose one has a set of particles 
{l,...,n} arranged on an underlying graphical structure G, where each particle i has a "spin" 
<jj G {0, 1}. In the limit, as the spins become fully repulsive / anti-ferromagnetic, the probability 
measure associated with this model (in equilibrium) is a Gibbs measure with support on spin con- 
figurations cr s.t. the set of particles receiving spin 1 corresponds to an i.s. of G. Furthermore, for 
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any i.s. X € 1(G), the probability that the spins are in the configuration corresponding to X equals 

a |j| ( ^'r 1 ' where A 

is a non-negative parameter whose logarithm has an interpretation in 

Sei(G) 

terms of the "temperature" of the system, and |S| denotes the cardinality of S. This system is called 
the hardcore model. Note that to further quantify this probability, one must compute the relevant 
normalizing constant / partition function (p.f.), which equals A' 5 '. When A = 1, computing 

SeX(G) 

the p.f. is equivalent to counting the number of i.s. in G (a #P-Complete problem [121]); as A — > oo, 
all the probability mass gets put on the largest i.s., and computing the p.f. is analagous to finding 
the cardinality of the maximum i.s. 

Such anti-ferromagnetic M.r.f. models have a rich history in the physics literature. Models in 
which G is a lattice were studied in [35], [18], [108], [109], [110], [53], [99], [96], [6]. This work was ex- 
tended to the three-regular tree by L.K. Runnels in [100], and the general regular tree (i.e. Cayley 
tree with general co-ordination number) in [24]. In these early works, it was observed that the 
underlying models underwent a phase transition as one varied A. As this concept will be central to 
the rest of the paper, we now make this more precise. For a fixed graph G and A £ M + , let f\,G 
denote the Gibbs measure, on the i.s. of G, which assigns i.s. I probability A' 1 ^ A' 5 ') . 

Sei(G) 

Let Trf denote the rooted (at r) depth-d tree in which all non-leaf nodes have degree A ; and 
^\,T d ( r £ X) the probability that the root is included in the corresponding random i.s. Then for 

each A > 3, there exists a critical actvity Aa = (A — 1) A_1 (A — 2) _A s.t. for all A S (0, Aa] (i.e. 
uniqueness / long-range boundary independence regime), lim F\ : j> d (r € X) exists; for all A > Aa, 

the limit does not exist. In particular, for A < Aa, there is an asymptotic independence on the 
boundary condition at the base of the tree; for A > Aa, there is a non- vanishing dependence on 
this boundary condition [66]. It is well-known that the existence of this limit can also be phrased 
in terms of whether or not an appropriate infinite tree has a unique translation-invariant Gibbs 
measure [125] , [34] , [17] , [120] , [2] , [1 19] , [55] . 

More recently, it has been shown that this same phase transition also corresponds to the point 
at which certain Markov chains for sampling from the i.s. of a graph of maximum degree A switch 
from mixing in polynomial time to mixing in exponential time [87]. Furthermore, it was shown in 
[121] that for all A < Aa, the problem of computing A' 5 ' admits a Fully Polynomial Time 

Sei(G) 

Approximation Scheme (FPTAS) for all graphs of maximum degree A. Combined with the results 
of [106], [107], [43], which show that no such FPTAS exists for A > Aa unless certain complexity 
classes collapse (i.e. NP = RP), as well as several recent related works [3], [105], [78], [19], [124], this 
shows that the aforementioned phase transition has deep connections to computational complexity. 

We note that questions regarding the existence of such phase transitions also arise in the context 
of various other applications, e.g. fitting correlation structures in computer vision models [111], 
since systems exhibiting long-range boundary dependence may be undesirable from a modeling and 
simulation perspective. 

1.3. Higher order M.r.f.. Note that in all of the aforementioned works using M.r.f. to study 
the i.s. of a graph G, non-trivial clique potentials were given only to cliques of size 1 (individual 
nodes) and 2 (edges), i.e. these were M.r.f. limited to nearest-neighbor (pairwise) interactions. In 
the language of M.r.f., this corresponds to letting the neighborhood Ni of the binary variable Aj 
(set to 1 if node Vi is included in the i.s.) be the set of neighbors of node V{ in G, so that the 
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cliques of the graph associated with the neighborhood system of the M.r.f. are the same as the 
cliques of the underlying graph G (supposing that G is triangle- free) . However, such measures 
are not capturing the full power of M.r.f. In particular, the neighborhood Ni of Xi can be any 
set of indices (not just those corresponding to edges in the underlying graph G), so long as all 
the clique potential functions evaluate to on any configurations which assign the value 1 to two 
nodes which are adjacent in G. For a graph G and node v, let Nd(v) denote the set of all nodes 

at graph-theoretic distance exactly d from v, N<d(v) = N^v) the subgraph induced by those 

k<d 

nodes at graph-theoretic distance at most d, and N>d(v) = Nk(v) the subgraph induced by 

k>d 

those nodes at graph-theoretic distance at least d. We use N(v) as shorthand for N%(v), the set 
of neighbors of v. To be consistent with the literature [111], let us say that a M.r.f. is of the k-th 
order w.r.t. G if the r.v. Xi associated with node of G has neighborhood Ni equal to the set of 
indices of nodes belonging to N<k(vi) \ Uj. The use of such higher order M.r.f. has recently been 
successful in modeling more complicated spatial dependencies in the computer vision community 
[93], [42], [111], [70], [71], [69], [98], [30], [74], [77], [86]. As the set of i.s. of a graph is known to exhibit 
complicated dependencies [23], and (as described previously) first-order binary M.r.f. have known 
limitations (e.g. the aforementioned phase transition) for understanding these dependencies [111], 
we pose the following question: 

Question 1. What is the power of higher order M.r.f. for understanding the i.s. of graphs? 

We will further restrict our analysis to A-regular graphs of large girth (i.e. length of the smallest 
cycle), as such graphs have a simple local description (i.e. A-regular tree), yet can exhibit com- 
plicated global behavior, and thus make an excellent testing ground for studying such questions. 
Furthermore, such graphs are often extremal w.r.t. properties of i.s. [121], and are "typical" of all 
regular graphs in an appropriate sense, i.e. a A-regular graph selected uniformly at random from 
all A-regular graphs looks locally like a A-regular graph of large girth [122]. Note that if G is a 
A-regular graph of girth at least k + 1, and k is even, the maximal cliques (i.e. cliques not properly 
contained in other cliques) of the associated k-th order M.r.f. are exactly sets of the form N < k(vi), 

k 2 
i.e. all the depth-— neighborhoods. Furthermore, these sets are all A-regular trees. 

Let us also suppose that the clique potential functions are homogenous and isotropic (h.L). More 
formally, note that the configuration induced on N < k(vi) by a binary vector x may be viewed as a 

k 2 
rooted (at Vi), depth-—, 0/1 labeled A-regular tree. Then we say that the associated set of clique 

potentials is h.i. if for all i,j (where i may equal j), and any binary |G [-dimensional vectors x, y, 

n N fe (^)( x A r ^ fc (^)) = ^jV. ft fe)(y at, if the rooted 0/1 labeled tree which x induces on N <k (vi) 

is isomorphic to the rooted 0/1 labeled tree which y induces on N < k(vj). In this case, for any fixed 

k and A, the associated M.r.f. has a finite-size description, i.e. an assignment of non-negative real 

k 

numbers to isomorphism classes of depth-— rooted 0/1 labeled A-regular trees. Then we may ask 
the following refinement of Question 1: 

Question 2. What is the power of constant- order h.i. M.r.f. for understanding the i.s. of 
constant- degree regular graphs of large constant girth? 

Note that "constant-order" , "constant-degree" , and "large constant girth" should be interpreted 
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as being fixed (albeit possibly very large) constants, independent of the overall size of the underlying 
graph, which is diverging to infinity. In analogy with the work on first-order M.r.f. for i.s., a closely 
related question is: 

Question 3. What phase transitions are associated with constant- order h.i.M.r.f. defined on 
the i.s. of constant- degree regular graphs of large constant girth, and what kinds of distributions on 
the i.s. can be induced by h.i.M.r.f. in the associated uniqueness regime? 

We note that some special cases of higher order h.i.M.r.f. have already been considered for study- 
ing i.s. and generalizations of the hardcore model. This includes work in the statistical physics com- 
munity which incorporates next-nearest-neighbor and/or competing interactions [39], [75], [115], [50], 
[49], [51], [52], [22], [4]; work which studies so-called kinetically constrained spin models (e.g. the Kob- 
Andersen model) [10], [20], [94], [68], [118], [41] and geometrically constrained spin models (e.g. the 
Biroli-Mezard model) [9], [97], [67], [73], in which a hard density constraint forbids configurations for 
which a particle has more than some fixed number of occupied neighboring sites; and work on mod- 
els assigning positive probability only to maximal i.s., in which every excluded node is incident to 
at least one included node [28] , as well as related combinatorial constraints [84] . Other related work 
comes from the field of stochastic geometry, in which the distribution of points in space depends on 
the local geometry of those points [57], [33], as well as stochastic models arising in communication 
networks [64], [61] and other areas [15], [92]. We note that although a higher order M.r.f. can always 
be represented as a first-order M.r.f. on a modified graph by increasing the associated alphabet 
and adding auxiliary nodes [117], [62], the potentials on this modified graph may not have the same 
natural interpretation in terms of neighborhood configurations. 

Other variants of the hardcore model have also been studied in the literature. This includes 
models associated with more general graph homomorphism [17], the Potts model of physics [123], 
and graph colorings [116], [82] - we make no attempt to survey that literature here. It also includes 
non-binary models, as well as models in which both nodes and edges receive labels [27], [2], [64], and 
various other modifications [114]. Particularly relevant are more general hyperedge models studied 
recently by several authors [89], [60], [36], [76], [101], [33], [112], [44], as the clique potentials associated 
with a higher order M.r.f. can be modeled using an appropriate set of hyperedges. 

1.4. Our contribution. In this paper, we take a step towards answering Questions 2 and 3, by 

analyzing the class of second-order h.i.M.r.f. on the i.s. of constant-degree regular graphs of large 

constant girth (i.e. the Bethe lattice / Cayley graph). Note that if the graph is A-regular, then the 

associated clique potentials are defined by A + 1 parameters, which we denote A, = {Oq, . . . , Oa), 

with A corresponding to the neighborhood configuration in which the central node is included and 

all neighboring nodes are excluded, and 6^ corresponding to the neighborhood configuration in 

which the central node is excluded and exactly k neighbors are included. In particular, an i.s. Z for 

which exactly no excluded nodes are incident to included nodes, m excluded nodes are incident to 

1 included node, . . ., and «a excluded nodes are incident to A included nodes, receives probability 
A 

proportional to A |:r| 0™ J . Note that if O = • • • = #A = 1, 

or more generally there exists some 

j=o 

fixed k, 7 > s.t. 9j = for all j, then our model reduces to the hardcore model. 

Our main results w.r.t. this model are as follows. We give sufficient conditions on X,6 for the 
associated Gibbs measure to exhibit long-range boundary independence, and give both necessary 
and sufficient conditions when the sequence {Oo, ...,6a} is log-convex. We then develop a pertur- 
bative approach to gain further insight into the geometry of this (A + l)-dimensional space (i.e. 
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uniqueness regime). Let 1(0) denote the (appopriately dimensioned) all ones (zeros) vector. Then 
we note that (after normalization) setting = A^ 1 1 reduces our model to the hardcore model at the 
critical activity, and ask for which vectors c is it true that our model exhibits long-range boundary 
independence if we set = A^l + ec for sufficiently small positive e, i.e. how can we bias slightly 
towards (against) certain neighborhood configurations while remaining in the uniqueness regime. 
We give a simple polyhedral characterization for this set of vectors under a convexity assumption on 
c. After identifying several features of this polyhedron, we also characterize (again as a polyhedral 
set) how one can change the occupancy probabilities through such a perturbation. We then use 
linear programming (l.p.) to analyze the properties of this set of attainable probabilities, showing 
that although one cannot acheive denser i.s., it is possible to optimize the number of excluded nodes 
which are adjacent to no included nodes. 

1.5. Outline of paper. The rest of the paper proceeds as follows. In Section 2, we formally 
define all relevant terms and state our main results. In Section 3, we rephrase the probabilities 
and questions of interest in terms of the relevant p.f. and associated recursions, and prove our 
general sufficient condition for long-range boundary independence. In Section 4, we prove our 
necessary and sufficient conditions for long-range boundary independence under a log-convexity 
assumption. In Section 5, we describe the framework for our perturbative analysis, and give a 
simple polyhedral characterization for the set of perturbations which exhibit long-range boundary 
independence. In Section 6, we characterize (again as a polyhedral set) how one can change the 
occupancy probabilities through such a perturbation, and use l.p. to analyze the properties of this 
set of attainable probabilities. In Section 8 we summarize our main results and present ideas for 
future research. We include a technical appendix in Section 9. 

2. Main Results. 

2.1. Preliminary definitions and notations. We now formally define the relevant Gibbs mea- 
sures, which correspond to second-order h.i.M.r.f. As our primary interest will be in regular graphs 
of large girth, we will always have a fixed reference degree A, which will often be implicit. For a 
fixed, strictly positive A € R + ,# = (do, . . . ,9a) £ R + ^ A+1 \ input graph G, node v € V, and i.s. 
I £ 1(G), let 



(!) ^a0g( u ' X ) = 



A if\N(v)\ = A,vel, 

VMrW if\N(v)\=A,v<£l, 
1 otherwise; 



,6>,g( x ) = II 'M.gM; 

and 

Z x,0,G= Yl w \,0,g( X )- 

zez(G) 

Let the Gibbs measure P^ q q denote the probability measure, on 1(G), s.t. for all 1 € 1(G), 



G' 



We now introduce some additional notation, to facilitate describing the above measure under various 
conditionings. For an i.s. I € 1(G) and set of nodes U G 2 V , let Ijj be the inclusion/exclusion 
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pattern (w.r.t. 1) of the nodes belonging to U. We note that when referring to boundary conditions 
(b.c), e.g. conditoning on the event {Ijj = B}, and taking associated infima and suprema, we will 
implicitly restrict ourselves to those b.c. with strictly positive probability. 

Recall that denotes the rooted (at r) depth d tree in which all non-leaf nodes have degree A. 

For an i.s. I of a rooted depth d tree T, let dl = l N>d _ 1 ^ 7 i.e. the b.c. induced on the bottom two 
layers of the tree. Then the main inquiry of this paper is under which conditions 

(2) liminfinfP A )T /r £ 1, |JV(r)p|X| = k dl = 
equals 

(3) limsupsupP A T ( r i X, \N(r)f]l\ = k dl = B J , 

and what inclusion/exclusion probabilities are attainable when these limits indeed coincide, in 

which case we denote the relevant limit as p^' , and the associated vector as p A '^. We also let 
A 

A 9 A \ — ^ A 9 

p + ' = 1 — y p k ' denote the corresponding limit of the probability that the root is included. 

k=0 

To be consistent with the literature [55], [119], we say that the vector (A, 9) admits a unique 
infinite-volume Gibbs measure on the infinite A-regular tree iff for all k € {0, . . . , A}, (2) equals 
(3). As all relevant quantities are unchanged if both A and 9 are normalized by the same positive 

constant, we w.l.o.g. normalize A to equal unity, define 7 = \~ 1 9, and state all results in terms 
of the analogous measure P7,T d , quantities p^, p^, and question of whether 7 admits a unique 
infinite-volume Gibbs measure on the infinite A-regular tree. 

2.2. Main Results. We now state our main results. We begin by giving a sufficient condition for 
uniqueness. For k,n E Z + s.t. n> k, and a vector x € R n , we let Ofc(x) denote the kth elementary 
symmetric polynomial on n variables evaluated at x. Namely, cro( x ) = 1> an d 

k 

^fcW = Yl H x h = S II x - 

l<«l<i2...<ifc<nj=l 5 , g2"t 1 ' , - ,, "J 

\S\=k 

For example, ^(x, y, z) = xy + xz + yz. Also, for x G M + ( A ~ 1 ) ) let 



A-l \ -1 



and 



^7 fc fj fe (x)J . 
k=o ' 



For two vectors x, y of the same dimension, we use the notation x > y to denote component- wise 
domination, i.e. Xj > y, for all i. For a scalar x G M + , we let f(x) = f(xl), g(x) = g(xl), and note 
that ^ 

f ( \ _ Yjk=0 Jk+lj k ) xk 

7 



and 



v fc=0 v 7 7 



Then 

Theorem 1. The system of equations (s.o.e.) 



(4) x = g{y) min / (z); 

z£ R+ (A-l) 
ll<Z<1/l 



(5) y = g(x) max / A x (z); 

z£ R+ (A-l) 
xl<z<j/l 

always has at least one non-negative solution on IR + x R + . // this solution is unique, then 7 admits 
a unique infinite-volume Gibbs measure on the infinite A-regular tree. 

We now provide necessary and sufficient conditions for uniqueness under a log-convexity as- 
sumption on 7. Recall that a strictly positive sequence {xi,i = 0, . . . ,n} is called log-convex iff 

1+1 > — — for all i G {1, ... ,n — 1}, and is called convex iff Xi+i — x% > Xi — Xj_i for all 
i € {1, . . . , n — 1}. Then 

Theorem 2. The s.o.e. 

(6) x = g{y)f /1 -\x); 



(7) y = g(x)f A ~\y); 

always has at least one non-negative solution on M + x M + . A log-convex vector 7 admits a unique 
infinite-volume Gibbs measure on the infinite A-regular tree iff this solution is unique. 

Note that Theorem 2 reduces to the known characterization for uniqueness in the hardcore 
model if we set 7 = A 1. We also note that our proof of Theorem 2 implies that the boundary 
condition in which all nodes on the boundary are included is "extremal" under the stated log- 
convexity assumption; i.e. if there is dependence on the boundary, than that particular boundary 
condition will induce oscillating behavior on the probability that the root is included. Several 
such monotonicity results appear throughout the literature, e.g. for so-called "repulsive" first- 
order M.r.f. [126], and fields which satisfy the well-studied positive (negative) lattice condition 
and related notions of positive (negative) association and the van den Berg-Kesten-Reimer (BKR) 
and Fortuin-Kasteleyn-Ginibre (FKG) inequalities [40], [90], [113], [14], [95], [101], [55], [112]. Although 
we do not pursue those connections here, it is an interesting question to further investigate the 
relation of these results to higher order M.r.f. for i.s. 

As an explicit description of when Equations 6-7 have a unique non-negative solution, and 
which inclusion/exclusion probabilities can be attained in this way, seems difficult in general, we 
develop a perturbative approach to gain insight into the geometry of the uniqueness regime. Recall 
that in the hardcore model (in which 7 = A -1 l), there is uniqueness iff 7 > AT 1. Thus 7 = AT^l 
defines the "boundary of uniqueness" in the hardcore model. We will probe the geometry of the 
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uniqueness regime by explicitly characterizing those perturbations around A A 1 in the parameter 
space which exhibit uniqueness. 

Let us fix a vector c = (c , . . . , ca), and for h G (0, A^ 1 (max |cj|) _1 ), define *y c,h = A^l + he. 

i 

It follows from a simple Taylor series expansion that 

Observation 1. There exists e c > s.t. for h £ (0, e c ), 'y c,h is log-convex iff c is convex. 

Let us say that c is a direction of uniqueness (d.u.) iff there exists e c > s.t. for all h £ (0, e c ), 
7 C '' 1 admits a unique infinite-volume Gibbs measure on the infinite A-regular tree; and a direction 
of non-uniqueness (d.n.u.) iff there exists e c > s.t. for all h £ (0, e c ), 7°'^ does not admit a 
unique infinite-volume Gibbs measure on the infinite A-regular tree. We now provide an explicit 
characterization / dichotomy theorem, classifying (almost) all convex vectors as either d.u. or d.n.u. 

For j E {0, . . . , A}, let Aaj = ( . ^ (A — 2)~ J . Let iv = (ttq, . . . , 7ta) denote the vector s.t. for 

je{0,...,A}, 

TT j £ A AJ ((A - 2) + (6 - 5A)j + 2(A - l)j 2 ). 

Then 

Theorem 3. A convex vector c is a d.u. if n ■ c < 0, and a d.n.u. if tt ■ c > 0. 

In particular, the hyperplane defined by n ■ c = represents a phase transition in the "perturba- 
tion" parameter space. We note that the question of what happens at the boundary (i.e. it ■ c = 0) 
seems to require a finer asymptotic analysis, and we leave this as an open question. As the property 
of convexity can be described using linear inequalities, Theorem 3 implies that the set of convex di- 
rections of uniqueness (c.d.u.) has a simple polyhedral description. In particular, let the polyhedron 
U equal the set of all vectors c = (co, . . . , ca) s.t. 

-c k -i + 2c fc - c k+ i < , k £ {1, . . . , A - 1}; 

TT • c < 0. 

Then 

Corollary 1. U is a full- dimensional cone. A vector c is a c.d.u. if c lies in the interior of 
U, and is not a c.d.u. if c can be separated from U. 

We now study some qualitative features of 7r, to better understand the set of c.d.u. We begin by 
noting that 

Observation 2. For all A > 3, tv > 0, 7ri < 0, 7r 2 < 0, and 7v k > for all k G {3, . . . , A}. 

That 7Ti < 0, 7T2 < 0, and > for all k € {3, . . . , A} is not surprising, since biasing towards 
excluded nodes which are adjacent to few included nodes should tend to reduce alternation, "push- 
ing" one into the uniqueness regime. That the cutoff occurs at exactly k = 2 can be further justified 
by noting that the average number of included neighbors of an excluded node in the hardcore model, 
at the critical activity Aa, is 1 + (A — £ (1,2). The counterintuititve feature of Observation 
2, which seems to violate the above reasoning, is that ttq > 0, i.e. biasing towards excluded nodes 
with no included neighbors (which we subsequently refer to as 0-nodes) leads to non- uniqueness. 
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A heuristic explanation for this surprising phenomenon is that such a biasing encourages all the 
neighbors of each 0-node, which are all at the same parity, to share the same fate (i.e. exclusion), 
thus increasing parity dependence. 

We now give an exact characterization for how moving in a c.d.u. changes the relevant occupancy 
probabilities. Let Sj^ denote the dirac-delta function, namely 6jk = 1 if j = k, and otherwise. Let 

v = (y o, . . . , i/a) denote the vector s.t. for k G {0, . . . , A}, = (AaA^^Aa fc, where we note that 
v represents the set of occupation probabilities associated with the hardcore model at the critical 
activity. Also, let v = (2A(A - 2)) ~ l v. For k G {0, . . . , A}, let p k = (po, . . . , p A ) denote the vector 
s.t. for j £ {0, ... , A}, 

p) = Aaj (*k (4 - (k - 1)A 2 + ((A 2 - 2A + 2)k - A 2 )/) + A- 1 ^, ^ ; 

and p + = (pq , . . . , p A ) denote the vector s.t. for j G {0, . . . , A}, 

p+^(2A 2 )" 1 A Aj (-(A + 2) + Ai). 
Also, let p denote the (A + 1) x (A + 1) matrix whose A;th row equals p k , k G {0, . . . , A}. Then 

Theorem 4. For any c.d.u. c, lim/i (p 7 ' — u) = p ■ c, and lim/i (p i — A - ) = p • c. 

Thus p captures how linear perturbations in the parameter space translate to linear perturbations 
in the space of occupancy probabilities. Note that it follows from Corollary 1 and Theorem 4 that 
the set of perturbations attainable in this manner is a polyhedral set, which may be expressed as an 
appropriate transformation of U. We now go one step further, explicitly carrying out this transfor- 
mation and giving a concise set of linear inequalities. Let us say that a vector p = (po, . . . , pa, P+) 
is a convex perturbation of uniqueness (c.p.u.) iff there exists a c.d.u. c s.t. (po, . . . , pa) = P • c, 
and p + = p + • c. Let the polyhedron T equal the set of all vectors p = (po, . . . , Pa-i) s.t. 

- A A!fc-iPfc-i + 2A aW - A i.+iPfc+i ^ > fc € {1, . . . , A - 2}; 

2(A 4 - 6A 3 + 13A 2 - 12A + 2) 2A 2 - 5A + 6 

E( A + k ^ + J (A-l)(A-2)2 ~ PA - 2 + A - 2 PA ^ ^ ° ; 

A-3 

— A(A - 1)(A - k)(2k + A - 3)p fc < 0. 

fc=o 

Then 

Theorem 5. T is a full- dimensional pointed cone. A vector (po, • • • , PA-i) corresponds to the 
first A components of some c.p.u. if (po, ■ ■ ■ , PA-i) ^ es in the interior ofT, and does not correspond 
to the first A components of any c.p.u. if (po, . . . , PA-i) can be separated from T. Furthermore, 
any c.p.u. p satisfies 

A-l 

PA = -(2A)- 1 Y,( A + k)Pk, 

k=0 

and 

A-l 

P+ = -(2A)- 1 ^(A-fc)p fc . 

k=0 
10 



As T is a pointed cone, it is the conic hull of its extreme rays, which one could in principle 
compute explicitly. Instead of pursuing such an analysis, we identify several important qualitative 
features of the set of c.p.u. In particular, 



Theorem 6. For every c.p.u. p, p + < 0, po > 0, pi > 0, and < 0, k G {2, . . . , A}. 

Note that Theorem 6 implies that moving in a c.d.u. cannot allow one to sample from denser 
i.s., since p+ < 0. Indeed, it is even true that 

Corollary 2. is the unique c.p.u. for which p + = 0. 

That is to say, any non-trivial c.p.u. actually leads to strictly decreasing the density of the 
sampled i.s. The relevant question becomes how one can decrease the density of the sampled i.s. by 
moving along a c.d.u., and what kinds of trade-offs between the various components of the c.p.u. 
are attainable. As all underlying sets are polyhedral, l.p. is the right framework for analyzing these 
trade-offs. In particular, it follows from Corollary 1 and Theorem 4 that 

Observation 3. We may use l.p. to efficiently select the c.d.u. maximizing any linear function 
of the resulting c.p.u., subject to any number of additional linear constraints on the c.d.u. 

We now illustrate with a concrete example. Suppose that our underlying graph represents a 
communication network, in which each node is either broadcasting or not, and adjacent nodes 
cannot simultaneously broadcast. Suppose further that even if a node is not broadcasting, it can 
still partially utilize the signal of any neighboring nodes which are themselves broadcasting. Thus, 
from a quality-of-service (QOS) perspective, it is particularly undesirable for an excluded node to 
have no neighbors which are broadcasting, i.e. 0-nodes are undesirable. In Scenario I, suppose a 
system "controller" has been tasked with slightly reducing the overall density of broadcasting nodes 
by perturbing the underlying -y vector, while minimizing the loss in QOS (increase in po per unit 
decrease in p+). The relevant optimization problem P I is 

min p° • c 

s.t. cell 

p + c = -1 

Alternatively, in Scenario II, we consider a system "hacker" tasked with moving along a c.d.u. which 
maximizes the loss in QOS. The relevant optimization problem P II is 

max p° • c 
s.t. c G U 
p+ • c = -1 

We now characterize the optimal solutions to P I and P II. Let c 1 = (cq, . . . , c^) be the vector 

s.t. c 1 = — — — — 1, and c 11 = (cq 7 , . . . , c A J ) be the vector s.t. cj/ = -A^ 1 (A — k) + tf^oA, k <E 

(A — 2jAa 2 

{0, ...,A}. Then 

Theorem 7. c 1 is an optimal c.d.u. for P I, and acheives a unit increase in po of rf" = 

A + 2 rr a A A 

— — — . c is an optimal c.d.u. for P II, and acheives a unit increase inpn of Tt t = — h 

(A-2)A A V J 11 (A-2)A A 

1. Furthermore, lim (tjj — tj ) = 1 — 2e _1 ~ .264. 

A->oo 
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Interestingly, Theorem 7 implies that to minimize the creation of 0-nodes, one should move 
along a "one-dimensional" c.d.u., which is exactly how one moves away from the critical activity 
in the hardcore model. Alternatively, to maximize the creation of 0-nodes, one should move along 
a non-trivial c.d.u. which cannot be realized in the hardcore model. Note that at the point c 11 , the 
constraint (of U) — co + 2ci — C2 < is not binding. This allows Cq^, which governs the bias towards 
0-nodes, to be larger relative to the other co-ordinates. 

3. Probabilities, partition functions, and proof of Theorem 1. In this section, we 
rephrase the probabilities and questions of interest in terms of the relevant p.f. For i 6 {1, . . . , A}, 
let T d denote the subtree of T d consisting of r, and the subtree rooted at the the ith child of r. 
Also, for a b.c. B on iV>d_i(r) in T^, and i 6 {1, . . . , A}, let B* denote the b.c. which B induces on 
N>d-i( r ) hi T l d . Since {T d ,i = 1, . . . , A} are all isomorphic, let us denote a generic version of this 
depth d rooted tree by T' d . Note that T' d is a modified A-regular tree, with a single extra (root) node 
"spiking out" on top. We denote the root of T' d (formerly r) by vq, and denote the single child of vq 
(formerly a child of r) by v\. For a binary vector x, let |x| denote Xj. For i, j 6 {0, 1}, and a b.c. 

i 

B on N> d ^(v ) in T' d , let Z d (i,j, B) ^ £ w x6t ,(1), and Z d (i, B) ^ f^y^r - Then it 

XeX{T' d ) d[ , , ) 

Xvq = i,Zvi = j 

dX=B 

follows from the basic properties of i.s. that P A q T ^^r ^ I, \N(r) = k\dT = B^ equals 

r£X,\N(r) f]X\=k xe{0,l} A i-l 

dX=B \ x \= k 



E w x,e, Td ( I )+H E w x,e, Td w *i£iW>.b<)+e*< E n^(°> x ;> B ') 

le/(T d ) i=o xei(T d ) i=o x6 {o,i} A i=i 

rel rtX,\N(r)f)X\=i |x|=i 

9X=B g I=B 

which is itself equal to 

v> e n^ xi (°' Bi ) 

xG{0,l} A i=1 
\x\=k 



(8) 



A A 

V- 



nf=i^(i,B0+E^ E n^r xj (°> Bj 



*=o xe{o,i} A i=i 

|x|=i 



which we denote by p fc (B). We also let p (B) denote the vector (p '' (B), . . . , p^' (B)j, and 
A 

p_f' (B) = 1 — Pfc'' d (B) denote the probability that the root is included. We now derive several 
fc=o 

recursions for Z d (i, B), to aid in our analysis. Let Cd(B) = Z d " 1 (0, B). For i, j € {1, . . . , A — 1}, let 
B* J denote the b.c. which B induces on N> d _i(vo) in the subtree of T' d consisting of i>o, ui, the ith 
child of v%, and the jth child of the ith child of v\. Then 
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Lemma 1. For all d> 5 and b.c.s B on N>d-i(vo) in T' d , 

Z d (l,B) = Z,(0,B)/(C d -i(B 1 ),Cd-i(B 2 ),...,Cd-i(B A - 1 )), 

and 

A-i 

Cd(B) = ^Cd-UB 1 ), Cd-i(B 2 ), . . . , Cd-i(B A - 1 )) J] /(Cd- 2 (B^), Cd- 2 (B 1 ' 2 ), . . . , Cd-atB^- 1 )). 

i=l 

Proof. Note that for i € {0, 1}, 

A-i 

Z d (i,0,B)= Yl II ^-i(0, Xi ,B J '), 

xe{o,i} A - 1 i =1 



and 



Thus 



A-i 



Z d (0,l,B) = A JJZ d _i(l,0,B < 



i=l 



A-l 



^d(0,B) 



Similarly, 



xe{o i i} A - 1 i=l 

i=i a n ' 7 xe{o,i} A - 1 »=i 



i=l d-iV ' ; xe{0 ,l}A-i i=l 



Combining the above with the definition of / and g completes the proof. □ 
We now construct bounding sequences for supCd(B) and inf Cd(B). We will then argue that the 

B B 

upper bounding sequence and the lower bounding sequence are monotone, and thus have limits, 
which must satisfy a certain s.o.e. If these equations have a unique fixed point, then these upper 
and lower limits must coincide, providing a sufficient condition for uniqueness (i.e. Theorem 1). We 
note that the idea of using bounding sequences to show uniqueness of Gibbs measures has appeared 
before in the literature (e.g. [27]). 

A 7 A _i, 7i+KA-l , A. -j A 

— " — - 1 max — - * — 
=0,...,A-1 7l 



Let C = 0, C 5 = 7o max l»±)*-\ t ± C, C 6 = Cs- For d > 7, let 

— 7—0 A — 1 *v. — ° — 



±, = 5 (C d -i) min^ / A - 1 (x), 



C d _ 2 i<x<C d - 2 i 



and 



We now prove that 



= <KC d _ x ) max / A ^(x). 
C d _ 2 i<x<C d _ 2 i 
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Lemma 2. For all d > 5, ( < inf ( d (B) < supCd(B) < ( d . 

B b 

Proof. It follows from the monotonicity of g that for all x > 0, 

(9) <?(*) < 7 _1 - 

Also, since 



/(x) 



Efc=o ^r-7fc^fc(x) 
Efc=o7fc^fc(x) 



'T ' l 

/(x) is a convex combination of { — — , i = 0, . . . , A — l}, and for all x > 0, 

Ti 

(10) min < /(x) < max ^±i. 

i=0,...,A-l 7j j=0,...,A-l 7j 

With (9)-(10) in hand, we proceed by induction. The base cases d = 5, 6 follows from non-negativity, 
Lemma 1, and (9)-(10). Now, suppose the induction is true for all k 6 {5, . . . , d— 1}, for some d>7. 
Then it follows from Lemma 1 and the monotonicity of g that £ < inf Q(B), and C d — su PCd(B), 

B b 

completing the proof. □ 
We now show that 

Lemma 3. {C d , d > 5} is monotone increasing, and {C d , d > 5} is monotone decreasing. 

Proof. We proceed by induction, simultaneously on both sequences. The base case, d = 5, 6, 
entails demonstrating that C < C < C 7 > an d C5 > C6 — (7- That C < C < C 7 follows from non- 
negativity. That £ 5 > £ 6 > ( 7 follows from (9)-(10). Now, suppose that £ fc < C k+l i(k — Cfc+i f° r 
all k € {5, . . . , d — 1} for some d > 8. Then by the monotonicity of g, and the fact that minimizing 
over a smaller domain can only yield a larger objective, 

id = 9(Cd-i) mm / A " 1 (x) 

C d _ 2 i<x<C d _ 2 i 

> g(( d _ 2 ) min / A - 1 (x) 
xeR+ ( A _7 X ' 
C d _ 3 i<x<C d _ 3 i 

Similarly, since maximizing over a larger domain can only yield a larger objective, 

Q = g(C d x ) max / A - 1 (x) 
C d _ 2 i<x<C d _ 2 i 

/•A-l, 

(A-l) 



< 5 (C d _ 2 ) max ^ / A - 1 (x) 



C d _ 3 i<x<C d - 3 i 



completing the proof. □ 
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It follows from Lemma 3 that Coo = nm Cd an d C = nm Cj both exist. Furthermore, since 

d— yoo — 00 d— yoo — " 

< < C d < £ 5 f° r au ^ > 5, the continuity of / and g on M + ( A_1 ) imply that 

Observation 4. < ("^ < Coo < °°> an d (C^jCoo) ^ s a solution to the s.o.e. (4) - (5). 

With Observation 4, we now complete the proof of Theorem 1. We actually prove a slightly 
stronger statement (in terms of p.f.), from which Theorem 1 will easily follow. 

Lemma 4. The s.o.e. (4) - (5) always has at least one solution (x*,y*) on M + x R + for which 
x* = y* . If the s.o.e. has a unique solution on R + x M + , then 



x* = C_ = liminf inf Cd(B) = limsupsupCd(B) = Coo = Coo- 

d— >oo B 



= x d-s-oo B 



Proof. We first demonstrate that (4) - (5) always have at least one solution (x*,y*) on M + x . 



A-l 

for which x* = y* . Let f]{x) = x — g(x)_f A-1 (x). Note that r/(0) = — r— < 0. It follows from 

7o 

(9)-(10) that ?/^7o^ 1 (. max ) A X ^ > 0. As r\ is continuous on [0,oo), we conclude that 

there exists z* € K + s.t. f](z*) = 0, which implies that z*l is a solution to the s.o.e. 

Now, suppose (4) - (5) has a unique solution on M + x M + . Then it must be that any solution 
(x,y) to the s.o.e. satisfies x = x* = y* = y. Since (C^jCoo) is such a solution, it follows that 
C^ = Coo m which case the desired implication follows from Lemma 2. □ 

Proof of Theorem I. Let L(z) = ^7 J . \z\ It follows from (8), Lemma 1, and Lemma 4 
that under the assumptions of Theorem 1, for all k € {0, . . . , A}, 

(11) liminf inf p7' d (B) = limsupsupp7' d (B) = = 



7fc ( fc ) C^o 



B d^oo B / (Coo) + MCoo) 

completing the proof. □ 

4. Log-convexity, monotonicity, and proof of Theorem 2. In this section, we show that 
when 7 is log-convex, the function / is monotone, causing the relevant s.o.e. to simplify considerably, 
and the conditions of Theorem 1 to become both necessary and sufficient (yielding Theorem 2). 
We begin by proving that 

Lemma 5. — — /(x) > for all i € {1, . . . , A — 1} and x > iff 7 is log-convex. 

UXj 

df 

Proof. Let us fix some i € {1, . . . , A — 1}. — — (x) will be the same sign as 

CXj 

A-l q A-l A-l q A-l 

(12) 7kCTk ^d^ ^ 1k+i°~k (x) - ^2 7 fc+1 o- fc (x)— 7fe^fe(x). 

fc=0 1 k=0 fc=0 1 k=0 
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Let x* denote the (A — 2)-dimensional vector equivalent to x, but with component i removed. Note 

d d 
that — — co(x) = 0, and for k > 1, — o"fc(x) = crfc_i(x*). Thus (12) equals 

A-l A-l A-l A-l 

E TjO-jW E 7fe+i^fc-i(x J ) - E j j+1 aj(x) ^2 7 fc fj fc _i(x J ) 

j=0 fc=l j=0 k=l 

e'ts e n*)(i>*« e n* 

i=° 5e2{ 1 — A - 1 } 7 v j=o 5g2 {i,...,A-i}v 2 G s 
" |S|=J " \s\=j 

A-l x . A-2 

E%'+i E II x ')(E%+i E II x " 

|S|=J " |S|=J 

which itself equals 

a*) (i>, e n*)(x;V* s 

i=o 5 g2 {i,-,A-i}\i ie5 y v j=o iSg2 {i,...,A-i}\i i £ s 

\S\=j ' \S\=j 

(14) +x/f>, +1 e n*)(x>i+* s n-0 

.7=0 Se2 {i,...,A-i}\i its y K j=o 562 {i 1 ...,A-i}\i ies J 
\S\=j ' \S\=j 



(is) -(eV e n^fE^i e n* 

|s|=j " |s|=i 

as) -/f^ e n*)(x:Vi e n4 



i=o 5e2 {i,...,A-i}\i i £ 5 ' v i=o Sg2 {i,...,A-i}\i ies 

\S\=j \S\=j 



It follows from the definition of fXj(x) that (13) is a polynomial function of xi, . . . , Xj_i, Xj+i, . . . , xa-i, 
with degree at most two in any given variable. Equivalently, there exists a unique set of finite con- 
stants {v Sl ,s 2 ,S 1 ,S 2 e 2{ 1 '- A - 1 >\\ Sif]S 2 = 0} s.t. (13) equals 



e n x j n *?• 

s li 5 2e2 { 1 .- A - 1 }\ 4 jeSi j&s 2 

Sins a =0 



We now compute vs lt s 2 - It follows from the "product of sums" form of (13) that 

(I 7 ) v Sx,S 2 = E 7|,4|7|B|+2- 

AjBg2 {l,...,A-l}\i 

Af]B=S 2 
A|JB=SiUS2 
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For any sets A,B, \A\ + \B\ - \AC)B\ = \A\jB\. Thus if Af)B = 5 2 , and A\J B = Si[jS 2 , it 
follows that 

|A| + |B|-|5 2 | = I^U^I 

= |5i| + 1521-15! f|5 2 | 
= |5i| + |5 2 |, 

since 5i Pj 5 2 = 0. Equivalently, 

\B\ = \S 1 \ + 2\S 2 \-\A\. 

Combining with the fact that for each A G 2^ 1, '"' A ~ 1 ^* s.t. 52 C A, there is exactly one set 
B G 2^ 1 ' - A - 1 ^ i satisfying A f~] B = 5 2 and A\J B = 5i, i.e. £ = 5 2 (J(5 X \A), we find that 

v Si,S 2 = E 7|A|7|5i|+2|5 2 |-|A|+2 

Ag2 {l,...,A-l}\ l 

S 2 CACSi(JS 2 



E U_ igj K' 7 2|&|+|&|-i+2- 
Similarly, (14) equals 

|Si|+|Sa| / ,g , \ 

e e ( _ \so\ pj+rfm\+\si\-j+* Xi n n *?> 



sins 2 =0 



(15) equals 

|Si|+|Sal 



/ 15 I \ 

E E ( a _ L I pH-nim+m-j+x II ^ II x * 



Si,s 2 e2{ 1 '- A - :L >\ < J'=|S2| VJ '" Z|/ ie5i jes 2 



5i n s 2 =0 

and (16) equals 

|5i| + |5 2 



e e L- i^i 9 |)%'+27 2 |s 2 i + i5 1 i- J+ i n ^ n ^ 



Sf 1 ,S 2 €2{ 1 '- A - 1 >\ < i=|S 2 | VJ j€Si je5 2 

5i n s 2 =0 



For 5i,5 2 G 2{ 1 -" A - 1 >\ < and j G [|5 2 |, |5i| + |5 2 |], let 

4i,S 2 0') = 7j72|S 2 |+|Si|-i+2-7i+l72|S 2 |+|S 1 |-j+l ) 

and 

2 c -\ A 

%i,S 2 UJ - 7j+l7 2 |S 2 | + |5 1 |-i+2 - 7j+ 2 7 2 |S 2 | + |5 1 |-i+l- 
It follows that (12) equals 

(«) e e Giiij^^wn^n-j 

Si,S 2 62ti.-A-x}\< J= |5 2 | v 1/17 jeSi je5 2 

SiD5 2 =0 
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\Si\ + \S 2 \ i |0| \ 

p») + e e (/ilk, & o) x, n n x ? 

Note that 4 1>52 (i) = -4 liS2 (2|S 2 | + \S X \ - j + 1) for all j s.t. j G [|S 2 |, |5i| + \S 2 \), and 2|S 2 | + 
|5i| - j + 1 G [|S 2 |, |5i| + |5 2 |], or equivalents j G [|5 2 | + 1, |S"i| + |5 2 |]. Combining with the 
monotonicity (in j) of the function 2|S 2 | + |£i| — j + 1, the fact that the unique solution to the 

\S I + 1 

equation 2|5 2 | + \Sx\ — x + 1 = x is x = \S2\ H (which may be non-integer), and the fact 

I Si I 1 ~ 1 5*1 1 ~)" 1 
that whenever |5 2 | H is an integer one has that z Sl s 2 {\^\ ^ ^ ) = we conclude 

that the coefficient of Xj J^J in (18) equals 

jeS! jeS 2 



|s 2 |+L^i±i_ 



Similarly, note that zfx.SaO') = z Si,S 2 (^l^l + - j) for all j s.t. j G [|5 2 |, + |5 2 |], and 

2|<5 2 | + — j G [|5 2 |,|iS'i| + |5 2 |], or equivalently j G [|5 2 |,|S'i| + [<S 2 |]. Combining with the 

monotonicity (in j) of the function 2|5 2 | + \Si \ — j, the fact that the unique solution to the equation 

I a I 

2|<5 2 | + \Si\ — x = x is x = 1 5^ | H — — (which may be non- integer), and the fact that whenever 

I Si I I S\ I 

— ^— is an integer one has that Zs 1 ,s 2 (\^' 2 \ ^ — W~ ) = ^' we conclude that the coefficient of 

Xj Y[ x j x j i n (19) equals 

je5i jes 2 



|s 2 |+L^J 



a/ 

Combining the above, we find that — — (x) is the same sign as 

dxi 

(22) 



|s 2 |+L^±ij 



e * A (i*d+ e (GI%,)-G-ifti-i)KA w )n^n4 

sins 2 =0 

Note that for all j G [|5 2 |, \S 2 \ + L ^y^ - J], one has that 2|S* 2 | + |5i| - j + 1 > j. Thus the 

definition of 4i,& i m plies that for all j G [|5 2 |, |5 2 | + |_ J], there exists j' > j s.t. Zg lt g 2 (j) = 

'Yjlji+i — Ij+ilj'- ^ then follows from the definition of log-convexity that if 7 is log-convex, 

\S I + 1 

then 4i,5 2 0) — f° r a ^ 3 e [1^2 |j + L 2 ^' Combining with (22) and the well-known 

I _l_ 1 

monotonicity properties of the binomial coefficients, i.e. that for all k < [ J , one has that 

(23) (™)-(™)>0, 



k ) \k-l 
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demonstrates that the log-convexity of 7 implies the desired monotonicity of /. 

Of 

We now show that if 7 is not log-convex, then there exists x > s.t. — — (x) < 0. Indeed, suppose 

dxi 



that there exists some k G [0, A — 2] s.t. k+2 < . Let us fix some set So £ 2^- 1 ''"' A s.t. 
= k. Note that the coefficient of j~J x^ in (22) equals 



Z11 



,5| 



l^l) = 7fc7fc+2 - 7fc+i7fc+i < 0. 



Let us fix some large positive integer M, and let x be the vector s.t. Xj = M 1 for j ^ S^, and 
Xj = M for j € S^. Then for M sufficiently large, it is easily verified that (22) is dominated by the 

single term corresponding to 1 | x? and thus — — (x) < 0, completing the proof. □ 

- LJ - J ax. 

jes* 

With Lemma 5 in hand, we now complete the proof of Theorem 2. We note that our proof of 
necessity also implies that the boundary condition in which all nodes on the boundary are included 
is "extremal" under the stated log-convexity assumption; i.e. if there is dependence on the boundary, 
than that particular boundary condition will induce oscillating behavior on the probability that 
the root is included. 

Proof of Theorem 2. It follows from Lemma 5 that if 7 is log-convex, then the s.o.e. (4) - 
(5) reduces to the s.o.e. (6) - (7), which (combined with Theorem 1) proves the sufficiency of the 
stated conditions. We now show that the conditions are also necessary. Suppose that the s.o.e. (6)- 
(7) has at least two distinct non-negative solutions. Let S denote the set of all 2-vectors (x, y) 

s.t. < x < y < 00, and (x,y) is a solution to the s.o.e. Let y = supz2, i.e. the largest number 

appearing in any solution pair. We first show that y is itself part of some solution pair (i.e. it is not 
just approached). Indeed, consider any sequence of solution vectors {z l ,i > 1} s.t. lim z 2 = y. Since 

i— >oo 

{z\,i > 1} is uniformly bounded, the Bolzano- Weirerstrass Theorem implies that {z l ,i > 1} will 
itself have a subsequence {z lk , k > 1} s.t. {z^ , k > 1} converges, and let us denote this limit by x. 

That (x,y) satisfies the s.o.e. then follows from the continuity of / and g. Similarly, let x = inf z\, 

i.e. the smallest number appearing in any solution pair, and y the other number appearing in the 
corresponding solution pair (whose existence is guaranteed by the same argument used above). 
Note that x<y. 

Let B denote the b.c. with all nodes at depth d included. We now prove (by induction) that 
{Q(B),d > 3} has a non- vanishing parity-dependence, with even values lying below x, and odd 
values lying above y. We begin with the base cases d = 3,4. Note that ^(0,0, B) = #o#a~1' 
Z 3 (0,1,B) = A0£~\ and thus 

C3(S)=7 - 1 (^) A - 1 - 
7a-i 



19 



That Cs(£>) — U then follows from (9)-(10). Similarly, 

z 4 (o,o,B) = Yl o^i'^Kooetzl^- 1 -^ 

= et'ef^ E ^n(7o 1 (^) a - 1 ) w 



^- 1 C 1 1)2 A 5 - 1 (7o 1 (^) A - 1 ) : 

7a-i 

g A-l fl (A-l) 2 



and Z 4 (0, 1, B) = A0f 0X_! ' . We conclude that 

UB) = (^) A - 1 5 (7o 1 (^) A - 1 )- 
7o 7a-i 

< f A -Hx)g{y) = x, 

with the final inequality following from (9) - (10), the fact that y = g(x)f A ~ 1 (y), and the mono- 
tonicity of g. This completes the proof for the base case. 

Now, suppose the induction is true for all d € {3, . . . , 2k} for some k > 2. Then it follows from 
Lemma 1, and the monotonicity of / and g, that 

Q2k+i{B) = 5(C 2fc (S))/ A - 1 (C 2fe -i(^)) 

> g{x)f A -\y) 

> ff(^)/ A-1 (y) = V , since x<x; 

and 

t2k+2(B) = g{t 2k+1 {B))f^- l {t 2k {B)) 

< 3(y)/ A " 1 (s) 

< 9{y)f A ~ l (x) = x , since y > y, 

completing the proof. 

Recall that L(z) = 7i I . ) z 1 . Since {B l ,i = 1, . . . , A — 1} are identical b.c, (8) and Lemma 

i=o ^ % ' 
1 imply that for all d > 5, 



It follows from the monotonicity of L and / that for all even d > 5, 

L{x) ,-i 



while for all odd d > 5, 



D 7 ' d fS) > (1 + M ~ j 
P + ^>(1+ /A( _ )y , 



As 

, L(x) s -i , i(y) 

1 + ^7 > 1+ ■ / ' 



/ A (y) y v f A {x)> ' 

it follows that the conditions of the theorem are also necessary, completing the proof. □ 
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5. A perturbative analysis, and proof of Theorem 3. In this section, we develop a 
perturbative approach to gain insight into the geometry of the uniqueness regime, proving Theorem 
3. First, it will be useful to rewrite the s.o.e. (6) - (7). Note that g is a strictly decreasing function on 

K + , and thus has a well-defined inverse g^~ with domain (O,-/^ 1 ]. Let p(x) = xf~^~ 1 \x), q(x) = 

g^(p(x)^j, and r(x) = q(q(x)) — x. Then 

Observation 5. The s.o.e. (6) - (7) is equivalent to the s.o.e. 

(24) r{x) = 0, 



(25) y = q(x). 

Let xa = (A — 2) _1 . We note that in the hardcore model, at the critical activity Aa ; the limit 
Coo exists and equals xa- We now complete the proof of Theorem 3, and in fact prove a stronger 
result, which shows exactly how the solution to the s.o.e. (24) - (25) changes (as a perturbation 
around xa) as one considers the perturbed vectors ~/ c,h . For a fixed c and h, let S c ' h denote the set 
of all solution pairs (x,y) to the s.o.e. (24) - (25) over M + x M + . If S c,h contains a unique solution 
(i.e. c is a c.d.u.), which (by Lemma 4) must be of the form xl for some x G M + , we denote this x 
by x Cjh . Let 

A 

x c = (2A(A - 2) 2 )" 1 A A , fc ((A 2 - 2A + 2)k - A 2 )c k . 

k=0 

Then 



Theorem 8. A convex vector c is a d.u. if n c < 0, in which case lim h 1 (x c ^—xa) = x c ; and is 

h\0 



a d.n.u. if 7r-c > 0, in which case for all sufficiently small strictly positive h, there exist (x^ 



c,h c,h 



S c ' h s.t. lim(x^ - XA)h- 
h\a 1 



6(A 



A(A-2) 3 



1) 

7T ■ C 



2 , limfxn' 



C./i 



XA)h 



6(A- 1) 
A(A-2) 3 



, x 2 
i 



7T • C 



Note that when a convex direction lies in the uniqueness regime, the corresponding limit x c ^ 
is perturbed from xa by a magnitude linear in h; while in the non-uniqueness regime, there are 
solutions perturbed from xa by a magnitude non-linear in h. Before proving Theorem 3, we will 
derive Taylor expansions for r and various related functions. As these will often be joint expansions 
in both h and x, we will make the dependence on h explicit using a subscript notation, e.g. r/j(x). 
When discussing the properties of such a function in a neighborhood of a value ho for h and xq for x, 

we use the notation (ho, xq) for the point of reference. For I £ {0, 1}, let fi(x) = jk+i f 



so g(x) = / 1 (x), and f(x) = fi(x)g(x). For I € {0, 1} and i, j G Z + , let 
' (A _ 2)J +1 /A - 1 



k=0 



k 



(A - iy 

k=j v 
o 



(A -2) 



-(k-j) 



i = Q,j G {0,...,A-1}; 

Ck+l i = l,j G {0, . . . , A - 1}; 
otherwise. 
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Let 

«o = (A - 2)- 2 ((A - l)c /l)lj0 - Ac /0) i j0 ), 
ki = (A - 2)- 2 ((A - l) c/l ,i,i - (A - 2)c /oil)1 ), 
and r' h (x) = d x rh(x). Then 

Theorem 9. qh{x), rf l (x), andr' h (x) are jointly real analytic functions ofh,x at (0,xa), where 
they have the Taylor expansions 

oo oo oo oo oo oo 

<lh{x) = ^2^2cq :i:j h t (x-x A y , r h (x) = ^2^2cr ti jh t (x-x A y , r' h (x) =^2^2c r/ ^jh l (x-x A y. 
i=0 j=0 i=0 j=0 i=0 j=0 

Furthermore, 

c q,0,0 = %A , c q,0,l = ~ L 

A(A-2) 3 

C r ,0,0 — , C rj o,l — , C r fi,2 — , C rj o,3 — g(A _ 1)2 ' C ' r ' 1 ' ~~ 

A-2 A 1 

Cr,l,l — — ^ _ j- c /o,l,0 + ^ _ 2 c /i,l,0 ~ 2k 1 7 C ri2 ,0 — -^^l.l^O, 

and 

<V,0,0 = , C r ' ,0,1 = , Cy^o = 

Proof. The proof is deferred to the appendix. □ 

We will also need a certain continuity result, which ensures that for sufficiently small h, one can 
restrict the search for solutions to the s.o.e. (24) - (25) to a small neighborhood of x A . In particular, 

Lemma 6. For all e > 0, there exists 5 > s.t. if h £ [0,5], r^(x) = implies \x — x A \ < e. 

Proof. The proof is deferred to the appendix. □ 

With Theorem 9 and Lemma 6 in hand, we now complete the proof of Theorem 8, which imme- 
diately implies Theorem 3. 

Proof of Theorem 8 and Theorem 3. Observe that 

A-l / a -i \ a A-l 



fe=0 V 7 k=0 V 7 



(A-2) 2 ^V k K ' ™'A-2^V k J 

v ' k=l v 7 k=l v 7 

A. / A-2, ft, , 2(A-1),„ x . fc,\ 

fc=o ^ 7 
-(A-LpV-c. 
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First, suppose it ■ c < 0, and thus > 0. Let x* h be any solution to (24), where we have made 
the dependence on h explicit. We begin by proving that there exists e > 0, depending only on c, 
s.t. 



(26) 



h G [0, e] implies \x* h — xa\ < (1 + |«o[)^- 



Indeed, it follows from Theorem 9, Lemma 6, and the multivariate Taylor's theorem that for any 
fixed 5 > 0, there exists eg G (0, oo), depending only on 8 and c, s.t. h G [0, e^] implies 



(27) 



A(A - 2) 3 , „ 3 , 1 2 

6(A - l) 2 ^ Xh ~ XA ' Cr ^ x h ~ x ^) h ~ 2 Cr ^' lK ° h 



< 8{h\x* h — xa| + h 2 + \x* h — xa| 3 ) ■ 



A(A - 2) 3 1 

Let 5 = min ( — — r=-, -0^1,1). Suppose for contradiction that for some h G [0, ej], \x* h — x&\ > 

1 2. { _A 1) 8 

(1 + |Ko|)/i. Then h 2 < (1 + |K |) _1 |xj!j — x&\h, and since (x* h — xa) 3 is the same sign as x* h — xa, it 
follows that the l.h.s of (27) is at least 



A(A-2) S 



6(A - l) 2 lXh 

However, the r.h.s. of (27) is at most 

5\x* h — xa| 3 + 25\x* h — x/\\h, 

and the desired result (26) then follows from the fact that our choice of S leads to a contradiction. 

We now prove that if c r ^^\ > 0, then there exists e G (0, oo), depending only on c, s.t. h £ [0, e] 
implies that the s.o.e. (24) - (25) has a unique solution x c /j. It follows from Theorem 9 and the 
multivariate Taylor's theorem that for any fixed 5 > 0, there exists eg G (0, oo), depending only on 
5 and c, s.t. h G [0, eg], x G [xa — (1 + |^o|)^> xa + (1 + |^o|)^] implies 



(28) 



|r^(x) — cv^i/i < 5h. 



Taking 5 = -c rt i,i, we conclude that there exists e G (0, oo), depending only on c, s.t. 

h G [0, e] implies r' h (x) > for x G [x/\ — (1 + |«ol)^i X A + (1 + l^ol)^]- 

Combining with (26) completes the proof. 

We now complete the proof of the first part of the theorem. Combining (26) and (27), we find 
that for any fixed 5 > 0, there exists eg G (0, oo), depending only on 5 and c, s.t. h G [0, e^] implies 



1 



c r ,i,i(x c ,h ~ XA)h - -c T) i^K h 



A(A 



1 + M)V < 5(2 + \K \)h 2 + 5(1 + \K \Yh 3 , 



6(A- l) 2 



and thus 
(29) 

Letting 5, h — > 0, it follows that 



lim/l 1 (x c h — Xa ) = — Kn. 
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Noting that 
Kq 



%A c k+l 



A-l 

A E 

fc=0 



A-l 



x A c k 



^-^((A-DE^- 1 ) 

v fc=0 v 7 

= (A(A-2) 2 )^ W^VA-2)- fc ((A 2 -2A + 2)fc-A 2 )c fc = 2x c 
completes the proof. 

Alternatively, suppose it ■ c > 0, and thus Cj-n < 0. It follows from Theorem 9 and the multi- 
variate Taylor's theorem that for any fixed 5 > 0, there exists eg G (0, oo), depending only on 8 and 
c, s.t. h G [0, e$], x G [xa — e§, %A + £s] implies 

/A(A-2)3 

-{x - xa) + c ri i 5 i(x - XA)n 



(30) 



Let a 



U(A-l) 5 



< <5(/i|x — xa| + \x — xa\ 3 + h?). 



A 6(A-1) J 



^ ^ .^c r> i ; i, and e G (0, a) be arbitrary. Let us fix a third strictly positive constant 

5(a,e) (to be defined), depending only on e and c, and apply (30) to conclude that there exists 
<x5(a,e) ^ (0>oo), depending only on 5(a,e) and c, s.t. h G [0, Ss(a,e)] implies 

/A(A-2) 3 



r h{%A + (e + «) 2 /i 2 ) > ( 



6(A- l) 2 

i /A(A-2) 3 



(e + a) 



(e + a) 2 +^1,1(6 + 0)2^/12 -<5(a,e)( (e + a) 2 + (e + a) 2 + 1 
e — <5(a, e)( l + e + a + (e + a 



6(A- l) 2 



/ii. 



Picking <5(a,e) sufficiently small thus ensures that t7j(xa + (€ + 0)2/12) > for h G [0,?j( tte )]. 

Repeating the same argument, but with xa + (— e + 0)2/12, we conclude that under the same 
assumptions r^ixA + (— e + a) 2/12) i s at most 

- 1) 2 + Q ^ + Cp » 1 ' 1 ^ + ^ + ^ a ' £ ) ((~ e + «)^ + (~ e + «)^ + ^ 



(-e + a 



1 A(A-2) 



6(A-1) 2 



e + <5(a, e) I 1 — e + a + (— e + a)" 



and picking <5(a, e) sufficiently small thus ensures that ^(xa + (— e + a) 2 /i 2 ) < for /i G [0, ^s(a,e)]- 
We conclude that for all e G (0, a), there exists <5(e,c) G (0, 00), depending only on e and c, s.t. 

h G [0, 5(e, c)] implies that Eq. (24) has a solution lying in [xa + (— e + a) 5 /12 , xa + (e + 0)2/12]. 
Combining the above with Theorem 9 (i.e. c g> o,i = ~~ 1)> the fact that y = <?/i(x), and the multivariate 



Taylor's Theorem, and taking the iterated limit lim lim, completes the proof. 

e->0/i->0 



□ 



We now complete the proof of Corollary 1. We first formally define the constraint matrix of the 
polyhedron U. In particular, for k G {1, . . . , A — 1}, let u k = (u , . . . , u A ) denote the vector s.t. 



u 



k A 



-1 j = k - 1; 

2 j = fc; 

-1 j = fc + l; 

otherwise. 
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Also, let u A = 7r, and U denote the A x (A + 1) matrix whose kth. row equals u fc , k £ {1, . . . , A}, 
i.e. IA defines the constraint matrix of U. Before completing the proof, we also compute several 
relevant summations, as they will be useful in later proofs as well. 

Lemma 7. 

, a -^a A. A ^A-l)^ 1 
£ A ^ = (a-2 } < S» A ^ = A ( A -2)A < 

E^ = 2A^1^ , ^3 Aa . = a(5A _ 6) (^1^!. 
Proof. The proof is deferred to the appendix. □ 



,A-1. A _, A . . A-l, 



Corollary 3. 

E^ = -(a^) A_1 < E^ = A (a^) A-1 > EA fc = (2A 2 r 1 A A , fe (A + 2-Afc) , 

i=0 i=0 i=0 

X:P^ = (2Ar 1 (fcA-(A+2))A A , fc , ^p+ = -(2A 2 )- 1 ( A ^) A - 1 , £p^= (aA)" 1 ^)*" 1 

i=0 i=Q i=0 

A A 

Y,P k r =A A , i (2A 2 )- 1 (A + 2-Ai) , ^A; A fc = A A , i (2A)- 1 (A*-(A + 2)). 

fe=0 fc=0 

Proof. The proof is deferred to the appendix. □ 

Proof of Corollary 1. The second part of the corollary follows immediately from Theorem 
3, and U is trivially a cone. All that remains to be shown is that U is full-dimensional. It suffices 
to demonstrate that 

(31) rank{U) = A, 

since in that case the system of inequalities (s.o.i.) IA ■ x < has no implied equalities. It follows 
from a straightforward induction that {u fc , k = 1, . . . , A — 1} are linearly independent, since for 
k € {1, . . . , A — 1}, u^ +1 / 0, while u 3 k+1 = for all j < k — 1. Thus it suffices to demonstrate that 

u A is not a linear combination of {u fc , k = 1, . . . , A — 1}. Suppose, for contradiction, that there 

A-l 



exist y = (yi, . . • ,y A -i) ^ s.t. u A = 7T = y fc u fc . Then 



fc=i 

-yi = tto; 

2yi -y-2 = 7Ti; 
-yjfe-i + 2yfc - yfc+i = TTfe, fe G {2, . . . , A - 1}; 
-y A -2 + 2y A _i = 7r A _i; 
-y A -i = tt A - 
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fc-1 



It follows from a straightforward induction that y& = — — for all k S {1, . . . , A — 1}. 

A-2 

We conclude that yA-i = — ^^(^ — 1 — Combining with the fact that — yA-i = ""A, 

j=0 

A 

we find that ^](A — 1 — = 0. Combining with Corollary 3 yields a contradiction, since 

3=0 
A A 

(A — 1) ^^7Tj ^ ^]i7rji completing the proof. □ 

j=o j=o 

6. Proof of Theorems 4 and 5. In this section, we complete the proofs of Theorems 4 and 
5, characterizing how moving in a c.d.u. changes the relevant occupancy probabilities (i.e. c.p.u). 

Proof of Theorem 4. We use the notation o(h) to denote a function s.t. limh~ 1 o(h) = 0. 

h\.0 

Let Xc,h = h~ 1 (x Cy h — x A ). Then it follows from Theorem 8 and (11) that for all sufficiently small 
non-negative h, 

(32) = 7f (*) < h (/ A (*c*) + £ 7- ' h (t) <n) 

As Xc^l is a solution to the s.o.e. (6) - (7), it follows that x c ^ = 5'(x Cj / l )/ A ~ 1 (x Ci / l ), and thus 
f A (x c ,h) = x^ h f{x c ^ h )g~ 1 (x^ h ) 

i=o v y i=0 



Combining with (32), we find that p fc equals 



i=0 

which itself equals 

-l 



AA,fc(A A 1 + c fc /t)(l + x A 1 Xc ^ h h) k ( ^(1 + A- 1 i)(A A 1 + c;/i)A A ,i(l + x^Xc^^M 

^ i=0 ' 

( A 

= A A ,fc(A A 1 + c fc /i)(l + kx^Xc,hh + o(h)) ( ^A A ,j(l + A ^)(A A X + c,-/i)(l + ix^x^h + o(fc)) 

^ i=Q 

= A Ajfc A A 1 (\ + (A A c fc + (A - 2)k Xc ,h)h + o(h)j 

f A , A xA \ ~ 

A A X 2 A A,i(l + A- X i) + (A - 2)A A * ( E A A) i(l + A" 1 *)* J XC)A fc + £ W 1 + A~ 1 i)c i /i + o(/i) 

v i=0 ^ i=0 ' 1=0 / 
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It follows from Lemma 7 and a straightforward calculation that A^ 1 Aa,»(1 + A 1 z) = A , and 
(A — 2)A^ 1 ( ^Aa,j(1 + A _1 z)i | = A 2 — 4. Combining the above and simplifying, we conclude 

\ i— n / 



that 



As Aa&A7 A - = Uk, and it follows from Theorem 8 that limx c ,/i = x c , we conclude that 

/14.0 



lim/t~ (p, — i/jfc) equals 
^4-0 



f fe ^(A - 2)fc - A- X (A 2 - 4)) ((2A(A - 2) 2 )' 1 £j A A ,,((A 2 - 2A + 2)i - A 2 )c^ 
-A" 1 A A,i(l + A- x i) Ci + Aac^ , 

i=0 J 

which (after simplifying) completes the proof of the first part of the theorem. 

A 

We now prove the second part of the theorem. Since p+ = 1 — y. P// > an d it follows from 

fc=0 

A 

Lemma 7 and a straightforward calculation that 1 — Y2 Uk = A_1 > the first part of the theorem 

fc=o 

implies 



A 

C 



lim/r^pT' - A" 1 ) = -Vff 

fc=0 



"EE A <V (** ( 4 - ( fc - !) A ' + (( A ' " 2A + 2 ) fc " A2 )*) + A ^kk) c 

fc=0 8=0 V ^ ' J 

- a a,, ( J> fc (4 - (fc - 1)A 2 + ((A 2 - 2A + 2)fc - A 2 ) A + A" 1 ^ kk 



i=0 \fc=0 v y k=0 

The second part of the theorem then follows from Lemma 7 and a straightforward calculation, 
completing the proof. □ 

We now complete the proof of Theorem 5, which gives an explicit polyhedral description of the 
set of c.p.u. Recall that U denotes the constraint matrix for the polyhedron U (which defines the set 
of c.d.u.), whose kth row equals u k , k € {1, . . . , A}. We now similarly define the constraint matrix 
of the polyhedron T. It will be convenient to scale certain rows by positive constants, which does 
not effect the underlying cone. In particular, for k 6 {1, . . . , A — 2}, let t k = (tg, . . . , t^-i) denote 
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the vector s.t. 





3 


= k- 1 




3 


= fc; 




3 


= fc+1 



otherwise. 



Also, let t 



A-l 



A-l 



>*A-i) denote the vector s.t. 



..A-l A 



j G {0,...,A-3}; 



i(A-2) A (A + j) 
f A - 2) A ~ 2 

(A 4 - 6A 3 + 13A 2 - 12A + 2) j = A - 2; 



1 



A-l 



(A 



vA-l 



(2A 2 - 5A + 6) 



j = A - 1; 



t A = (t A , . . . , t^-i) denote the vector s.t. 

tf i |— A(A - 1)(A - i)(2j + A - 3) j € {0, . . . , A - 1}; 

and T denote the A x A matrix whose fcth row equals t fc , k £ {1,...,A}, i.e. T defines the 
constraint matrix of T. It will also be convenient to introduce a "truncated version" of the matrix 
p. In particular, let p denote the A x (A + 1) matrix consisting of the first A rows of p. 

We begin by proving several lemmas. 
Lemma 8. T -p = U. 

Proof. The proof is deferred to the appendix. 



□ 



Lemma 9. 



> A = -^f> + *)p fc - 



fc=0 



Proof. The proof is deferred to the appendix. 

With Lemmas 8 - 9 in hand, we now complete the proof of Theorem 5. 



□ 



Proof of Theorem 5. It follows from elementary linear algebra that for any two matrices 
A,B, rank(A ■ B) < min (rank(A), rank(B)) . Combining with Lemma 8 and (31), we conclude 
that 



(33) rank(T) = A , rank(p) = A, 

and thus T is a full-dimensional pointed cone (in analogy with the proof of Corollary 1). 

Since p is a A x (A + 1) matrix with rank A, there must exist some vector z £ M A+1 s.t. the set 
of vectors {p k , k = 0, . . . , A — 1} [^J z is linearly independent. Let us fix such a vector z, and let p 
denote the (A + 1) x (A + 1) matrix whose first A rows comprise the matrix p, and whose (A + l)st 
row is z. Since rank(p) = A + 1, the matrix p has a well-defined inverse p~ . Let T denote the 
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Ax (A + 1) matrix whose first A columns comprise the matrix T, and whose (A + l)st column is 

0. Then it follows from Lemma 8 that 1~ ■ p = U, and therefore 1~ = U ■ p~ . 

Now, suppose that some vector p = (po, . . . , PA-i) belongs to the interior of the polyhedron T, 

1. e. T • p < 0. Let p = (p, 0), and c = p 1 ■ p. In that case, 

U-c = Up l \> 
= f p. 

Since T ■ p < implies T • p < 0, it follows that U ■ c < 0, and thus c is a c.d.u. by Corollary 1. 
Furthermore, since c = p _1 -p implies that p = pc, we conclude that p^ = p k c, k € {0, . . . , A — 1}. 
It follows that p are the first A components of some c.p.u. 

Alternatively, suppose that p are the first A components of some c.p.u., i.e. there exists a c.d.u. 
c s.t. p = p ■ c. In that case, 

T ■ p = T pc 

= U-c < 0, 

where the final inequality follows from the fact that c is a c.d.u. It follows that p cannot be separated 

from T. Combining the above completes the proof of the first part of the theorem. 

A-l 

That any c.p.u. p satisfies pa = — (2A) _1 ^^(A + A;)p/t follows from Theorem 4 and Lemma 9. 

k=0 

A-i A 

That p + = — (2A) _1 ^^(A — k)-pk then follows from the fact that p+ = — ^^Pfc- Combining the 

fc=o fc=o 
above completes the proof. □ 

7. Proof of Theorems 6 and 7. In this section, we use Observation 3 to complete the proofs 
of Theorem 6, Corollary 2, and Theorem 7, by analyzing an appropriate set of l.p.s. 

PROOF of Theorem 6. For a vector z = (z , . . . , za) which is not all-zeros, let P x denote the 
l.p. 

max z • c 
s.t. c € U 



The corresponding dual l.p., which we denote by D z , is 



min 

S.t. — di + dA7T = z 

2di - d 2 + dA7Ti = zi 
-dj-i + 2dj - dj+i + d A 7Vj =zj , j G {2, . . . , A - 2} 

-dA-2 + 2dA-i + dA7TA-l = ZA-i {I) 
-dA-i + dA7TA = za {II) 
d > 
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Recall that Corollary 1 implies U is full-dimensional, and thus it cannot be the case that z • c = 
for all c G U. Combining with the fact that T is a cone, we conclude that at least one of P z , P~ z 
must be unbounded. Furthermore, since T is pointed and thus does not contain a line, it follows 
that exactly one of P z , P~ z is unbounded, while the other (bounded) l.p. must have optimal value 
0. Applying strong duality, we conclude that D z is infeasible iff P _z has optimal value 0. We now 
prove that is infeasible for k G {0,1}, D~P is infeasible for k 6 {2,..., A}, and D~P is 
infeasible, implying the desired theorem. 

It follows from a straightforward induction that any feasible solution (f.s.) d = (di, . . . , d A ) for 
D z (should such a f.s. exist) must satisfy 

(34) d j = lj2U-i)[^i~<) JG{1,..-,A-1}. 

I i=0 

But by adding Equalities (I) and (II), we find that any f.s. d must also satisfy 

dA(7TA-l + 7Ta) - (za-1 + Za) + (dA-1 - dA-2) = 0. 
A-2 

Since (34) implies dA-i — dA-2 = (^A^i — Zj), we may combine the above to conclude that 

i=0 

A A - A-2 _ A 

cIa '^2 7V i — ~^2 z i = 0- Equivalently, after applying Corollary 3, dA = — — -) A 1 ^^ z j. Com- 

i=o i=0 i=0 

bining with the fact that the non-negativity constraints require that dA > 0, we conclude that 
A A 

if ^^z.j > 0, then D z is infeasible. It follows from Corollary 3 that ^^pf > for k G {0,1}, 

i=0 i=0 
A A 

p\ < for k G {2, . . . , A}, and pf < 0. Combining the above completes the proof. □ 

i=0 i=0 

Before completing the proofs of Corollary 2 and Theorem 7, it will be useful to introduce some 
further notation and l.p.s. For vectors z 1 = (zq, . . . , z A ), z 2 = (zq, . . . , z A ), and scalar y, let P z ' z ,y 
denote the l.p. 

max z 1 • c 
s.t. c G U 
z 2 • c = y 

1 2 

The corresponding dual l.p., which we denote by D z ,z ,y , is 

min dA+iy 

S.t. — dl + dA7T + d A +lZo = Zq (I) 

2di - d 2 + d A 7Ti + d A +iz 2 = z\ (II) 
-dj-i + 2dj - d j+1 + dATTj + d A +iz 2 = z) , j G {2, . . . , A - 2} (A) 
-dA-2 + 2d A -i + d A 7r A -i + dA+iz A _ x = z A _ x (III) 
-d A _i + d A 7r A + d A+i z A = z A (IV) 
di,...,d A > 
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It follows from a straightforward induction that any f.s. d = (di, . . . , d^+i) for D z ' z ,y (should 
such a f.s. exist) satisfies (I), (II), and (A) iff 

(35) d, = /^^-^(dATTi+dA+izf-z, 1 ) j 6 {1, . . . , A - 1}; 

I i=0 

which implies that for j € {2, . . . , A — 2}, 

(36) (dj + i - dj) - (dj - dj-i) = dA7Tj + cIa+iz] - zj. 

We now prove several preliminary results w.r.t. various feasible solutions to the above l.p.s. 

Lemma 10. There exists a feasible solution for the I. p. D~P 'P + ~ 1 with value —rf'. 

Proof. The proof is deferred to the appendix. □ 

Lemma 11. There exists a feasible solution for the I. p. DP P 1-1 with value rfj. 

Proof. The proof is deferred to the appendix. □ 

With Lemmas 10 and 11 in hand, we now complete the proofs of Corollary 2 and Theorem 7. 

PROOF of Corollary 2. It follows from Lemma 11 that the l.p. DP°'P + '° is feasible and has 
optimal value zero. Thus strong duality implies that the l.p. PP P ,0 has optimal value (note that 
is always feasible). Combining with Theorem 6, we conclude that any c.p.u. p s.t. p+ = must 
also satisfy po < 0, pa-i < 0. But since Theorem 5 ensures that for any c.p.u. p, {A^~ k Pk,k = 
0, . . . , A — 1} is convex, it follows that po < 0, pa-i < implies Pfc < for all k £ {0, . . . , A — 1}. 
Combining with the characterization for p + given in Theorem 5 ensures that p/j = for all k € 
{0, . . . , A — 1}, since if any of the p^ was strictly negative, then p + would be strictly positive. The 
characterization for pa given in Theorem 5 ensures that pa also equals zero in this case, completing 
the proof. □ 

PROOF of Theorem 7. That c 1 is convex is trivial; that 7r • c 7 < follows from Corollary 3. 
It also follows from Corollary 3 that 



p+.c' = 2A 2 (A-2)- 1 A A 1 ^p^ 



i=0 

and 



p -c' = 2A 2 (A-2)- 1 A A 1 ^p° = rt 



4 = 

Combining with Lemma 10 and strong duality completes the proof of the first part of the theorem. 
We now show that c 11 is a c.d.u., and begin by showing that c 11 is convex. Note that c] 1 — Cq 1 = 

— — A^ 1 — A, while for all k E {2, . . . , A}, cj, 1 — cj/_i = — — A^ 1 , implying convexity. It follows from 
Corollary 3 that 

A A 



7T 



i=0 i=0 
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P 



-i 



i=0 



i=0 



A 




.7-/ 



E^°)+ A <°; 




i=0 



i=0 



Combining with Lemma 11 and strong duality completes the proof of the second part of the theorem. 



8. Conclusion. In this paper, we used higher order M.r.f. to study i.s. in regular graphs of 
large girth. To our knowledge, this is the first such explicit analysis for i.s. We gave sufficient condi- 
tions for a second-order h.i. M.r.f. to admit a unique infinite-volume Gibbs measure on the infinite 
A-regular tree, and gave both necessary and sufficient conditions when the relevant clique poten- 
tials of the corresponding Gibbs measure satisfy a log-convexity assumption. We then developed a 
perturbative approach to gain further insight into the geometry of this (A + l)-dimensional space 
(i.e. uniqueness regime). Interpreting our own model as a (A + l)-dimensional perturbation of the 
hardcore model at the critical activity, we showed that under a convexity assumption, the set of all 
such perturbations which preserve uniqueness is a polyhedron with a simple description. We also 
gave an explicit polyhedral description for how one can change the occupancy probabilities through 
such a perturbation. We then used l.p. to analyze the properties of these polyhedra, showing that 
although one cannot acheive denser independent sets, it is possible to optimize the number of ex- 
cluded nodes which are adjacent to no included nodes. 

This work leaves many interesting directions for future research. The full power of higher order 
M.r.f. for sampling from i.s. in sparse graphs, and the associated uniqueness regime, remains poorly 
understood. Several questions build immediately on the models considered in this paper, e.g. what 
occupation probabilities can be attained by second-order h.i. M.r.f. with log-convex clique potentials 
in the uniqueness regime, outside of the neighborhood of the critical activity (i.e. the setting we 
analyze in depth), or the setting of log-concave clique potentials (which includes the restriction 
to maximal i.s.), where the relevant recursions also simplify. More generally, can one user higher 
order M.r.f. (in the uniqueness regime) to sample from denser i.s. than can be attained using the 
hardcore model at the critical activity? It would also be interesting to study higher order M.r.f. for 
related combinatorial problems, e.g. graph coloring. 

The algorithmic implications of phase transitions for higher order M.r.f. also remain open ques- 
tions. In particular, one would expect a "complexity transition" at the uniqueness threshold w.r.t. 
approximately computing the relevant p.f. and sampling from the associated distributions, as has 
been recently established for first order M.r.f. [121], [106], [107], [43], [3], [105], [78], [19], [124]. It would 
also be interesting to investigate the possibility of converting our perturbative analysis into a "gra- 
dient descent" algorithm for finding an "optimal" Gibbs measure, subject to a "uniqueness regime 
constraint". Another possible direction would be to investigate the connections between our work 
and the recent work on computing p.f. using belief propagation and related message-passing algo- 
rithms [21], [74], [104]. 

Finally, it is open to investigate the connection between higher order h.i. M.r.f. and recent research 
on sparse graph limits [80]. This includes work studying notions of convergence in large random 
graphs [5], [44], as well as other notions of convergent graph sequences [7], [13], [11], [59], [31], [32], [85]. 
It would also be interesting to study the relationship between higher order h.i. M.r.f. and the re- 
lated notions of local algorithm and i.i.d. factor [48], [46], [47], [45], [81], [38], [12], [26], [59], in which 



The final part of the theorem follows from the fact that 
the proof. 




completing 



□ 
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one samples from the i.s. of a graph by assigning nodes random weights, and using these weights 
to select nodes for inclusion in a distributed, localized manner. In particular, to what extent are 
higher order h.i.M.r.f. and their associated Gibbs measures capable of (approximately) encoding 
the relevant distributions on i.s., especially if one generalizes to conditional M.r.f. and M.r.f. on 
larger label sets? 

Acknowledgements. The author would like to thank Kavita Ramanan for several stimulating 
discussions. 

9. Appendix. 

9.1. Proof of Theorem 9. In this subsection we complete the proof of Theorem 9. We begin by 
deriving the relevant Taylor expansions for several other functions. We first prove that 

Lemma 12. For I G {0, 1} ; fi : h( x ) is a jointly real analytic function of h,x at (0,xa), where it 



Proof. The real analyticity follows from the fact that fi t h( x ) is a polynomial in h,x. Note that 



oo 



oo 




i=0 j=0 



A-1 




fl,h( X ) 



i=0 




A-1 



(37) 




A-1 



(38) 



We first analyze (37), which by an interchange in the order of summation equals 




Since 





it follows that (37) equals 



A-1 



(39) 




j=0 
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We now analyze (38), which by an interchange in the order of summation equals 

j=o ^ i=j \ 1 y v/ / j =0 

Combining (37) - (38) with (39) and (40) completes the proof. □ 

To proceed, we will need to compute similar expansions for various composite functions. We will 
rely on the following two facts about power series. The first, about composition of power series, 
follows from the multivariate Faa di Bruno formula [25]. 

Observation 6. For two functions cth(x), fih{x) s.t. (5h{x) is a jointly real analytic function of 
h,x at (0,xq) and a^(x) is a jointly real analytic function of h,x at (0, (3q (xq)) > with associated 

coefficients {c a ^j,i,j,£ Z + } and {c/3,jj,i, j, £ Z + }, the composite function r]h(x) = ah(/3h{x)) is 
itself a jointly real analytic function ofh,x at (0,xq), with associated coefficients {c V! ij,i, j, € Z + } 
satisfying 



i i+j ji 

C„ . v 

l 2 ^2 



°V,i,j - S jfiCa,i,0 + E E Ca ^l E II C (3 i< fc) j ( 2 k) ■ 

11=031=1 (4 1) ,4 1) )--.(4 Jl) ^ l) )e(^x^\(o,o)) ilfe = 1 



yii ,-(*) - ,- 



Specifically, 



C V,0,0 — Cq,0,0j 
Ctj,0,1 = Ca,0,l c /3,0,1' 

i 2 
C^,0,2 — Cqi,0,1 c /8,0,2 + CqjO^C^cIj 

3 

c r;,0,3 = C a ,0,l c /3,0,3 + 2c Qi o,2C ( 9,0,l c /3,0,2 + c a,0,3Cg,0,l > 
Cr/,1,0 = Cq.,1,0 + C a fi t iCf3 : ifl, 

£77,1,1 = c a,0,l c /3,l,l + 2c a , ! o,2C | g ) i j oC<3 ) o,l + C a ,i,iCg,o,l, 

2 

c r;,2,0 = C a ,2,0 + C a ,0,lC/3,2,0 + c a,0,2Cp,lfl + C a ,l,lC/S,l,0. 

The second, about multiplication of power series, is that 

Observation 7. For two functions ah(x), (3h(x), both of which are jointly real analytic func- 
tions of h,x at (0, xo) with associated coefficients {c a! ij,i, j, G Z + } and {c^ij,i,j,G Z + }, the 

product r]h(x) = ah{x)(3h(x) is itself a jointly real analytic function of h,x at (0, xq), with associ- 
ated coefficients {c V: ij,i,j,G Z + } satisfying 

i i 

c V,iJ = c <x,h,ji c /3,i— h,j-ji- 

i 1= ji=0 
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Specifically, 

Cjj,0,0 = Ca,0,0C[3,0fi, 

C V ,0,1 = C a ,0,0C/3,0,l + C a ,0,lC/3,0,0) 

Cjj,0,2 = ^,0,0^,0,2 + Co,0,lC/3,0,l + Ca,0,2C/3,0,0j 

c r/,0,3 = Ca,0fiC(3,0,3 + Ca.O.l c /3,0,2 + Co,0,2C/3,0,l + Co,0,3C/3,0,0> 

Cjj,1,0 = Cq-,0,0C/3,1,0 + 00,1,0^,0,0; 

Cjj,l,l = Co,0,0C/3,l,l + 00,0,10/3,1,0 + 00,1,00/3,0,1 + Co,l,lC/3, ,0, 
0,7,2,0 = 00,0,00/3,2,0 + c o, 1,00/3,1,0 + Co,2,OC/3,0,0- 

We now use Observations 6 - 7 to derive expansions for several functions. In particular, 
Corollary 4. Fori G {0,1}, fj~h(x) is a jointly real analytic function ofh,x at (0, xa)j where 

oo oo 

if /ias f/te expansion ff^ix) = Cy-i i jh' l (x — x^Y ■ Furthermore, 



n 

i=0 j=0 



A(A-2) (A + l)A(A-2) 2 



c /f\o,o - ( A 2 ) 1 ' c /f\o,i - 1 > c / r \o,2 - 2 (A-1) ' ^r 1 ' - 3 ~ 6(A-1) 2 

2 

1 in /a i Cr — 1 -i -i . „ / ■ -v\ o i Cr — 1 



"/j > 1 ' (A -2) 2 ' /«" A-2 (A -2) 2 ' Vf 1 AO (A-2) 3 ' 

Proof. Let a h (x) = c^ (l - x)" 1 , /3 h (x) = -cj* 0fi (fi jh (x) ~ c/j,o,o)> and note that ff^{x) 
a h (f3 h (x)). Let 



o, 



A fc^ 0(Q i = 0,j > 0, 
otherwise; 



l- c ^o,o c /i« otherwise. 

Since f3h(x) is a jointly real analytic function of h, x at (0, xa) with associated coefficients {c^jj, i, j E 
Z + }, /3o(xa) = 0, and cth(x) is a jointly real analytic function of h, x at (0, 0) with associated coeffi- 
cients {c a ,ij,i,j G Z + }, it follows from Observation 6 that f[~^(x) is a jointly real analytic function 

oo oo 

of h,x at (0,xa), where it has the expansion ff^(x) = X^X^ C / _1 ij^ % ^ x ~ ■ Furthermore, 

i=0 j=0 

c f-\o,o = c / ; !o,o = ( A - 2 ) x ; 



c f-\o,i - c /i!o,o( c / ; !o,o c / ; ,o,i) 

- ^-'(-^{^f (V)) - - 
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C f-\0,2 ~ C hfiA C /(A0 C /i-0,2) + c /^ ,o( C f l fi,0 C flfiAY 

\ _ CM" 1 I - (A - 9W 

(A-l)iV 1 



- ( .- 2rI (- (A - 2) -^(--)) + ( .- 2rI (- (A - 2) -^(- 



c ,-1 



A(A-2)_ 
2(A- 1) ' 



- ^r'(-(A- 2 )-^(V)) 

+(A - 2)-' ( - (A - 2)"' < A - 2 » 2 ^ - - (A + DA(A - 2)= 



(A-l)'V 1 77 6(A-1) 2 



/,-M.o - ^U-'A^) = (A-^-tA-^-^c) = -^ffp; 



c /f',i,i = c /iko( _c /,ko c /i-i.i) + 2c a!»^(- c /,ao c /i,i,o)(-c / ,J w) c/ 1 ,o,i) 
= (A-2)- 1 (-(A-2)- 1 c /lJ , 1 ) 

^-^■(-(A-^W^-tA-^lff^- 1 ) 
_ 2 C/ii i )0 c/,,1,1 



A- 2 (A -2) 2 ' 



r 2 



~ c /,Ao( -c /iAO c /j.2,o) + c /,i 1 o( -c /,i,o c /i.i,o) 2 

= (A-2)- 1 (-(A-2)- 1 .o)+(A-2)- 1 (-(A-2)- 1 c /(>1 , ) 2 = (A _ 2)3 - 

□ 

For i > 0, let X i = (A - 2)- 1 (c /liM - c /o , M ), and £ = Xl - (A - 2) X o- Then 

Corollary 5. f^ 1 (x) is a jointly real analytic function of h,x at (0, xa), where it has the 

oo oo 

expansion f^ l {x) = y~] y~] Cf-i i: jh l (x — ia) j . Furthermore, 
i=o j=0 

c /-i,0,0 = 1 . Cf-M.l = , = , C f -l A3 = , Cj-i^Q = -xo, 

t _ XoCf 1: i,o 

C /-M,i - ~? ' c /- 1 ,2,o - A _ 2 • 
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Proof. Let a h (x) = f ,h(x), /3 h (x) = f lt l(x), and note that f h 1 (x) = a h (x)/3 h (x). Let c a>i j = 

c fo,i,ji an d c P,i,j = c / _1 ij- Since cth(x) is a jointly real analytic function of h,x at (0,xa) with 
associated coefficients {c a: ij,i,j £ and/3/j(x) is a jointly real analytic function of h, x at (0,£a) 
with associated coefficients {cgij,i, j € it follows from Observation 7 that f7~ (x) ls a jointly 



oo oo 



real analytic function of h, x at (0,xa), where it has the expansion f^ 1 (x) = S ^^Cj-i i jh l {x 

i=o j=0 

X/\) J . Furthermore, 

.(A-2) 1 /A-lV,, A . 
c/-i,o,o = c /oi0 ,oc /r i AQ = ( (A _ 1)0 (^ Q J) ((A -2) ) = 1; 



C f-\0,1 ~ C fo,0,0 c f-\ ,1 + C /o,0,l c /- 1 ,0,0 



(A- 2)V A-m, l)+ (gz%(* ))(A _ 2) -> = o; 



(a - i)o v o 77 v ' V(a- 1) 1 V 1 



c f-\0,2 = ^0,0,0^-^0^ + C/ ,0,l c /- 1 ,o,l + C /0'°' 2C /r 1 ,0,0 

' (A-2)Y A-l\\/ A(A-2) \ / (A-2) 2 /A-l\\ 

.(A-i)oV o v 2(a-i); + V(a-i)iV 1 )r ' 



c /-\0,3 = C /o,0,0C /r i A3 + C/oAlC/f 1 ^^ + C /oA2C /r i A i + Cf 0j0 ,3C f -l fifi 

AA-2) 1 /A - 1\ \ / _ (A + l)A(A-2) 2 \ / (A-2) 2 /A-l\\ / A(A-2) \ 
V(A-l)°l ;A 6(A-1) 2 J + \(A-iy\ 1 JJ\2(A-1)J 

(A-2) 3 fA-l\\ r iN t f(A-2) 4 fA-l\\ rA t 



c f-\i,o - c fo,0,OCf-\i l0 + c fo,i,o c f-\ ,o 

'(A-2) 1 /A-l\\ / C/li i, 



(A-1)°V 



c /-\i,i - c fofi,o c f-\i,i + c hfl,i c f-\i,o + c h,ifi c f-\o,i + c hXi c f-\^ 

(A-2) 1 (A - 1\\ { 2c ful>0 c fuhl \ ( {A-2f (A - 1\\ ( _ C/l|1|0 



(A-1)°V 77 V A-2 (A-2) 2 7 \(A-iy\ 1 77 V (A-2) s 
+c /o ,i,o(-l) + c /o ,i,i(A-2)- 1 = 
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c /-\2,0 = c /o,0,O c /- 1 ,2,Q + C /o 1 l,O c /- 1 ,l 1 + C /o,2,OC / -l 0iQ 

( (A-2) 1 fA-l\\f 4 10 \ / C/ \ XoC/ 

□ 

Corollary 6. f h ^ (x) is a jointly real analytic function ofh,x at (0, xa), where it has the 

OO oo 

expansion f h ^ A ^(x) = c^-(a-i) ijh l (x — xa) j . Furthermore, 

i=o j=0 

Cj-(A-i) 00 = 1 , Cy-(A-i) j0jl = , Cy_(A-l) j0)2 = , Cj_(A-l) j0 3 = , Cy_(A-l) )1)0 = -(A— 1)xoj 

/ a 1U (A-l)xo , A 
c /-( A -i),i,i = -(A-l)i; ) c /-(A-i) )2 ,o = 2 ' a - 2 C /i. 1 <° ~ c /o,i,o)- 

PROOF. Let ath(x) = x A_1 , /^(x) = f^ l (x), and note that /^^^(x) = a>h(/3h(x)). Let 

'A-r 



, i = 0,jG{0,...,A-l} 

J 

otherwise; 



and c^jj = Cj-i jj-. Since @h(x) is a jointly real analytic function of /i, x at (0,xa) with associated 
coefficients {cp ; ij,i,j G /3o(xa) = 1, and a/j(x) is a jointly real analytic function of h, x at 

(0, 1) with associated coefficients {c a> i t j,i,j G Z + }, it follows from Observation 6 that /. 0*0 
is a jointly real analytic function of h,x at (0, xa), where it has the expansion /, ^ ^(x) = 

oo oo 

Cj-(a-i) ijh\x — xa) j - Furthermore, 

i=0 j=o 



Cy-(A-l) j0)0 



A - f 




c /-(a-d,o,i = [ A 1 jCf-i,o,i = y i J(0) = 0; 



A - A /A — 1\ 2 /A - 1\ / A - 1 



c /-(—),o,2 = ( l «/-A2+( 2 )cJ-x A1 = I ! )(0)+( 2 )(0) = 0; 



A-l\ /A-l\ /A-U o 

j )c/-i A3 + 2[ 2 )c / -i A1 c / -i )0)2 + I 3 l^-i^ 

V)(o> + a(V)(o)<o) + (V)(o) = <* 
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c f - iA - 1)10 = o + f A 1 ^ c f -i jl;0 = -(A-l)xo; 

c /-(A-i>,l,l = ^/-M,! +2 ( A 2 ^/-M^/^Ai + Wc/^Ai 

= (A-l)(-0 + 2f A " 1 V- X o)(0) = -(A-l)C; 



/A - 1\ /A - 1\ 2 

Cj-ca-d^^ = 0+1 1 1 c/-i )2) o + I 2 ) c /-i,i,o + (°) c /-M,o 

/ A n /xoc/^o\ 1 2 (A - l)xo , A 
= (A-l)(^-^-j+-(A-l)(A-2)x = (_ CAil|0 _ cyb4f0 ). 

□ 

Corollary 7. Ph( x ) is a jointly real analytic function of h,x at (0, xa), where it has the 

oo oo 

expansion Ph(x) = c p ^jh l {x — sa) 3 '- Furthermore, 

( a \-l -, n n (A-l)xo 
Cp,0,0 — — > Cp,0,l — 1 ) c p,0,2 — U , C Pj o,3 — U , Cp,l,Q — ^ — 2 ' 

(A-l)xi (A-l)xo , A 

c p,i,i A _ 2 ) c Pi 2,o - _ 2 ) ^ A - 2 C/l,1 '° ~ C/o ' 1 '°' ) ' 

PROOF. Let (^(x) = x, /3h(x) = / ft ^ A 1 ^(x), and note that Ph(x) = a/ l (x)/3/ l (x). Let 

(A-2)- 1 i = 0,j = 0; 
1 i = 0,j = 1; 

^ otherwise 

and cg^j = Cf-(A-t)ij- Since a/ l (x) is a jointly real analytic function of h,x at (0,xa) with 
associated coefficients {c a! ij,i, j € Z + }, and /^(x) is a jointly real analytic function of h, x at (0, xa) 
with associated coefficients {cp ij,i,j £ Z + }, it follows from Observation 7 that Ph{x) is a jointly 

oo oo 

real analytic function of h, x at (0, xa), where it has the expansion ph{x) = c p ^jh % {x — xa) j - 

i=0 j=0 

Furthermore, 

c p ,o,o = (A-2)- 1 c / _ (A _ 1)A0 = (A-2)" 1 (l) = (A-2)" 1 ; 



c pA i = (A-2)- 1 C/ - ( A-i) A1 + (f)c / _(A^) A0 = (A -2)~ 1 (0) + (f)(1) = 1; 
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c P ,o,2 = (A- 2 ) 1c /-(a-i),o,2 + (^/-(a-ii.o.i + (°) c /-(^-i),o,o 
= (A-^O + aXO) = 0; 



Cp,0,3 = (A - 2) 'cj-IA-lj^j + (l)Cy-(A-l) A2 + (Oj^-lA-lJ^! + (Ojc^-tA-l)^ 

= (A-2)- 1 (0) + (l)(0) = 0; 



c p ,i,o = (A- 2 ) ^/-(a-i^o + (0)c / -(a-i )jO)0 

= (A-2)- 1 (-(A-l) Xo )+(0)(l) = - (A A ~_ 1) 2 X ° ; 

Cp,i,i = (A - 2)- 1 c / _(a-i) i1i1 + (l)c / _(A-i ))li0 + (0)c / - ( A-i) A1 + (0)c / _(a-i ))Oi0 
= (A-2)- 1 (-(A-l)e)+(l)(-(A-l) Xo ) = ~ (A A "-2 X1 ; 

c P ,2,o = (A - 2) 4 c r( A-i) A() I (0)cj-(a-d i1i() I (0)cj- ( a-i) i0i() 

9W /(A-l)xo A \ (A-l)xo , A 

= (A - 2) ^ (^_ c/i|lj0 - C/() 10 ) j = ____(___ C/i 10 _ c/,,,1,0). 

□ 



Corollary 8. gt^( x ) i- s a jointly real analytic function of h,x at (0,xa), where it has the 

oo oo 

expansion g^{x) = '^2'^2 C 9^ ,i,jh l (x — ia) j . Furthermore, 
i=o j=0 

. x A(A-2) A(2A-l)(A-2) 2 

Cg^,0,0 = (A-2) , Cgf-,0,1 = -1 , C g <- j0 ,2 = "^T TT" , Cg^,0,3 ~ 



2(A-1) ' 9 ,u ' d 6(A-1) 2 

c /o,i,o . c /o,i,i c /o,i,o . c /0i i,q(2(A - l)c /o , M - (A - 2) 2 c /0i i,q) 

9 " 1 ' (A-2) 2 ' (A-2) 2 A-l ' 9 " 2 ' 2(A-l)(A-2) 4 

Proof. It follows from the Lagrange Inversion Theorem that g^ix) is a jointly real analytic 
function of h, x at (0, 50 a)) • Since 

• A-l 



go(x A ) = x A 1 ( A k ^A 



k=0 

it follows that g^(x) is a jointly real analytic function of h,x at (0,xa), where the coefficients 
{c g ^- j £ Z + } may be determined from {cy-i ijii>3 £ -^ + } by matching the coefficients in the 
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Taylor series of the composite function g^^g^x)) to those of the identity function, and solving the 
associated s.o.e. Thus letting a>h(x) = g^ (x), (3h(%) = 9h{x) = /oh( x )' ^ follows from Observation 



6 that 



XA 


= c 9 ^,o,o; 






1 




















= Cg^,a,iCf-\ ,3 + 2c ^'0,2C /o -i i01 c /o -i 


i0 ,2 + C 9^,0,3cJ-i 


,0,1' 





= %^,i,o + c s <-,o,ic / -i, 1 , ; 









= Cg^^!C f -l lx + 2c g ^ A 2C / -i 1)0 c / -i 


(0) 1 + Cg^,l,lC f -l 


,0,1' 





= Cg^,2fl + c g^,0,l c f-\ 2 ,0 + c g^fi,2C 2 f - 


1,1,0 + ^,U c / - 


1 ,i,o 


!., we 


find that 







Cg^,0,0 = (A - 2) 



-1. 



c ^' - 1 = c / -\o,i = " 1; 



C 9 -,0,2 - - C 9^,0,l C /-l A 2 C /o -l,o,l 



/ A(A-2) N = A(A-2) , 
1 J V2(A-1) / J 2(A-1) 



C 9 ^,0,3 - - c /o -i, ,i( < V-,0,l c ^- 1 ,0,3 + 2c fl^A2 c /-i,o,l C /-l,0,2 



(2A-l)A(A-2) 
6(A- l) 2 



2 



/ ,\f c /o,l,0 \ 
Cg^.1,0 = -C^,0,lC /o -i A o = -(-1JI - (A -2) 2 ) 



c f o,l,0 \ _ c /o,l,0 . 

(A -2) 2 ' 



c /o,l,l _ c /o,l,0 

(A -2)2 A-l' 
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ty-,2,0 - - fv~ .0,1^,2,0 + C 9+~W C )-\lfl + V-.l.l^-Sl.o) 

, _ , , ( C /o,l,0 \ , f A (A ~ 2 A / _ C/o,l,0 \ 2 , / C/ ,l,l _ C/o.nA / _ c /o,l,0 

\(A-2) 3 y V2(A- 1) / V (A -2)2^ + V(A-2)2 A-lA (A - 2) 2 
c/ ,i,o(2(A - 1)^,!,! - (A - 2) 2 c /0 ,i, ) 
2(A-l)(A-2) 4 

□ 

Recall that k = (A - 2)~ 2 ((A - l)c flt i >0 - Ac /o ,i, ), and k x = (A - 2)~ 2 ((A - l)c /l5 i,i - (A - 
2)^,1,!). Also, let k 2 = -(A - 2)- 3 ((A - l)c/ 0jl ,i + (A - 2)c/ 0)1 , ). Then 

Corollary 9. qh(x) is a jointly real analytic function of h,x at (0, xa), where it has the 

oo oo 

expansion qn{x) = c qi i i jh l {x — xa) j - Furthermore, 

i=o j=0 

! A(A-2) (2A - 1)A(A - 2) 2 

Cg,0,0 - (A -2) , C g , ,l - -1 , Cq, ,2 - ^ _ , C g , ,3 _ ^ ' 

A 2 - 2A + 2 A 

Cg,l,0 - K , C g ,i,i - _ _ 2 , C/ ,i, - — _ - Cft^o + K1 , C g , 2 ,0 - CgV- (2) + X0«2- 

Proof. Let ah(x) = (x) , ^(x) = Ph(x), and note that (^(x) = ah(Ph(%))- Let c a ,j,j = 

Cg^,i,j, c P,i,j = c p,i,j- Since f3h(x) is a jointly real analytic function of h, x at (0, xa) with associated 
coefficients {cp t i t j,i,j 6 ^o(^a) = xa, and a/^x) is a jointly real analytic function of h, x at 

(0, xa) with associated coefficients {c ai ij,i,j G Z + }, it follows from Observation 6 that qh(x) is a 

oo oo 

jointly real analytic function of h, x at (0, xa), where it has the expansion qh(x) = c q ^jh % (x— 
xa) j - Furthermore, 



Cg,0,0 — c g^,o,o — (A — 2) 



-l. 



Cg,0,l = ,0,1^,0,1 = (-1)(1) 



i 2 
Cg,0,2 — C ff ^,0,lCp,0,2 + C s ^,0,2C p ,o,l 



V 2(A-1) J y ' 2(A-1) 



Cg,0,3 — C s ^,0,lCp,0,3 + ^Cg^ ,0,2Cp,0,lC p ,0,2 + Cg^fi^C Q 1 



<-^+<^>>M- %rr >2 >> 3 - - A( VV ): 
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c q,l,0 — c g^,lfi + C 9^~ >0, lC 2?.l.Q 

/ c/ ,i,o \ f (A-l)xtA 



c g,l,l — Cg<-,0,lCp,l,l + 2c s <- i o,2Cp,l,O c p,0,l + c g^,l,l c p,0,l 



A 2 - 2A + 2 A 
(A-1)(A-2) C/0 ' 1 '° " A^**' 1 ' + Ki; 



2 

c g,2,0 = c g^,2,0 + c g^,0,l c P,2,0 + c g^ fi^pAfi C S <_ > 1 . lC P. 1 .0 



/->■ 

- c^, 2i o + (-1) (^ 2(A _ 2) Ca^^' 1 ' " C/o ' 1 '° ) J + U(A-l) 
I ( c /o44 c/o,i,o V (A-l)xo N _ 

With Corollaries 4 - 9 in hand, we now complete the proof of Theorem 9. 



A 



2 



□ 



Proof of Theorem 9. Let a h (x) = q h (x),/3 h (x) = qu{x). Let c a ^j = Cq,ij,cp t ij = c qji j. Since 
(3h( x ) is a jointly real analytic function of h,x at (0, xa) with associated coefficients {cp t ij,i,j 6 
Po(xa) = ^A, and och(x) is a jointly real analytic function of h,x at (0, ^a) with associated 
coefficients {c a> ij,i, j € Z + }, it follows from Observation 6 that Qh{x) = qh{qh{%)) is a jointly real 

oo oo 

analytic function of h,x at (0, xa), where it has the expansion Qh{x) = '^2'^2cQ i ijh l (x — xa)° ■ 

i=o j=0 

Furthermore, 

cq,o,o = Cg i0 ,o = (A-2) -1 ; 

cq,o,i = Cg j0 ,ic 9! o,i = (-1)(-1) = 1; 



2 

CQ, 0,2 — Cg 5 0,lCq,0,2 + c g,0,2Cg )0j l 



1 ^2(A-l); + l,2(A-l)^ ' 

CQ, 0,3 = C gi o,lCq,0,3 + 2c gi o,2C g ,0,lC (? ,0,2 + ^,0,3^,0,1 



/ (2A-l)A(A-2) 2 ^ , J A(A-2) \, ,^ A(A-2) \ ( (2A - 1)A(A - 
1 J V 6(A-1) 2 ; +2 V2(A-l)^ j V2(A-l)y' + V 6(A-1) 2 



A(A-2) 3 
6(A - l) 2 ' 
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CQ, 1,0 = Cg,i, + 0^,0,1^,1,0 = K + (— l)«o = 0; 



C Q,1,1 = C g ,o,lC g ,i,i + 2c g ,o,2Cg,l,oC g ,0,l + C g ,l,lC g ,o,l 
= 2c,j,i,iCq,o,l + 2c g ,o,2C g ,l,oCg,0,l 

= -2(^,1,1 + ^,0,2^,1,0) 
A-2 A 



C Q,2,0 — c q,2,0 + Cg,0,lC 9 ,2,0 + c g,0,2 c ,j,l,0 + Cq,l,lCq,l,0 

= Cg, 2 ,0 + (-l)Cg,2,0 + C g ,l,o(c g ,0,2C g ,l,0 + C<?,l,l) 



«o(Cg,l,l + C g ,0,2Cg,l,o) 



-CQ,l,lK . 



Since r/ l (x) = Qh{x) — x, it follows that r^ix) is also a jointly real analytic function of h,x at 

00 00 

(0,33a), where it has the expansion r^(x) = c r< i t jh l (x — xa) j - Furthermore, 

i=o j=o 

Cr,0,0 = Cq,o,0 — ^A = 0, 
Cr,0,l = C Q,0,1 — 1 = 0, 

and Cj-ij = CQjj for all other i,j G Z + . Differentiating rh(x), we find that r' h (x) is also a jointly 

00 00 

real analytic function of h, x at (0, xa)> where it has the expansion r' h (x) = c r i ^jh l (x — x^Y ■ 

i=o j=o 

Furthermore, we find that <y » j = (j + ljc^j+i for all i,j, G Z + , completing the proof. □ 
9.2. Proof of Lemma 6. 

Proof of Lemma 6. LetG /l (x,y) = {x-g h (y)f£~ 1 (x)) 2 + (y-g h (x)ff ^(y)) 2 , and note that 
satisifes the s.o.e. (6) - (7) iff Gh(x,y) = 0. Let S h denote the set of all (x, y) G M + 2 s.t. 
Gh(x,y) = 0. It follows from (9) - (10) that there exist e\,M\ € (0,oo), depending only on c (but 
not on h), s.t. 



(41) 

Using the identity a 2 
at most 



h G [0, ei], (x, y) G S h implies Mf 1 < x, y < M x . 
b 2 = (a + 6) (a — b), we find that for any x, y G M + 



l^foy) - G (x,y)| 



JS 



(42) 
(43) 



2x-g h {y)f^\x)-g {y)ft\x) 



+ 



2y-^(*)/My)-So(*)/r i (y) 



A-l, 



9h(y)ft 1 (x)-9o(y)f A - 1 (x) 
9h(x)ftHy)-9o(x)ft\y) 



It follows from (9) - (10) and (41) that there exist 62, M2 G (0, 00), depending only on c, s.t. 
(44) 

h G [0, €2], (x, y) G S h implies 



2x-g h (y)f^- 1 (x)-g (y)f^- i (x) 
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2y-g h (x)ft\y)-9o{x)f£-\y) 



A-l, 



< M 2 . 



Using the identity ab — cd = -((a — c)(b + d) + (6 — d)(a + c)), we find that for any x,y £ 



A-l, 



is at most 



(45) 



and 



(46) 



A-l, 



9h(y) -go(y) 



9h{y)+go{y) 



A-l, 



g h (x)ft\y)-9o(x)ft\y) 



A-l, 



is at most 



A-l, 



5ft 0*0 - 9o(x) 



+ 



9kW + 5o0*0 



A-l, 



It may be easily verified, using (9) - (10), that for any e > 0, there exists S t £ (0, oo), depending 
only on c and e, s.t. 



(47) 



max 



9h(x) - go(x) 



< e, max 

7ie[o,<5 £ ] 

se[Mf 1 ,Af 1 ] 



A-l, 



< e. 



Using (44) to bound the first terms of both (42) and (43), and (9) - (10) and (45) - (47) to bound 
the second terms of both (42) and (43), we conclude that for all e > 0, there exists 5' e £ (0, oo), 
depending only on c and e, s.t. 



(48) 

For e > 0, let 



h £ [0,5' e ],(x,y) £ S h implies |G (x,y)| < e. 



S e = l(x,y) £ [M 1 1 ,Mi] 2 s.t. \x - x A \ > e or \y - xa\ > el, 



and note that S e depends only on c and e. Since (x, y) = x&l is the unique solution to Go(x, y) = 
over the compact set [M-^Mi] 2 , and Go is continuous on [ikfj -1 , Mi] 2 , it follows that for all 
sufficiently small strictly positive e, [M 1 _1 ,Mi] 2 \ S e is non-empty, and 



(49) 



min Gq(x, y) > 0. 

(x, V )e[Mf 1 ,Af 1 ] a \Se 



Combining (48) and (49) completes the proof. 
9.3. Proof of Lemma 7. 
Proof of Lemma 7. Note that 

A A 



□ 



i=0 i=0 v 7 

i=0 



A 



1 sA-i 

A- V ' 



Let -B = B(n,p) denote a binomially distributed r.v. with parameters n,p. It is well-known that 
E[B] =np,E[B 2 ] = np(l — p + np), and E[B 3 ] = np(l — 3p + 3np + 2p 2 — 3np 2 + n 2 p 2 ). Combining 
the above, and simplifying all relevant expressions, completes the proof. □ 
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9.4. Proof of Corollary 3. 

Proof of Corollary 3. By repeatedly applying Lemma 7 and simplifying, we compute 
AAA A 

J^tt, = (A-2)^A Aii + (6-5A)^iA A>i + 2(A-l)^i 2 A Aii 

4=0 i=0 i=0 i=0 

{ A-2> ' 



A A A A 

J^TTii = (A-2)^iAA,i + (6-5A)^i 2 AA,i + 2(A-l)^i 3 A A , 

i=0 i=0 i=0 i=0 

v A-2 ; 



A 



Y^Pi = i> fe (4-(A;-l)A 2 )^A A , i + i > fc ((A 2 -2A + 2)A ; -A 2 )^iA A , ? ; + A- 1 ^A A , i( 5 ; 

i=0 

(2A 2 )- 1 A A , fc (A + 2- A A;); 



4=0 i=0 i=0 i=0 

,2\-l 



A A A A 

pfi = H (4 - (k - 1) A 2 ) i^A,i + i>fc((A 2 - 2A + 2) fc - A 2 ) ^ i 2 A A ,i + A" 1 ^ i A Aji ,5 life 

i=0 i=0 i=0 i=0 

= (2A)- 1 (A:A-(A + 2))A Aifc ; 



A A + ? A A 

Epi = -^E^ + (2A)- 1 ^ i A A , 

i=0 i=0 i=0 

= -(2A 2 )- 1 (|^) A - 1 ; 



XX* = -^e^a,4 + (2a ) - i x:^a a , j 

4=0 1=0 i=0 



A /A A N 

Ep4 fc = A A , i (2A 2 (A-2)A A )" 1 (4 + A 2 -A 2 i)EA A , fc + ((A 2 -2A + 2)f-A 2 )EAA,^ 



fc=0 V k=0 k=0 



+A ^j^iy 

fc=0 

A zV (2A 2 )- 1 (A + 2- Ai); 
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A /A A \ 

kp- = A AlJ (2A 2 (A - 2)A A ) -1 (4 + A 2 - A 2 *) ^ A Ajk k + ((A 2 - 2A + 2)i - A 2 ) ^ A Aife A: 2 

k=0 V fc=0 fc=0 / 



A 



+A- 1 A Aii ^M i . 

A:=0 

= A A , i (2A)- 1 (Ai-(A + 2)). 

□ 

9.5. Proof of Lemma 8. 



Proof of Lemma 8. We first prove that \ T ■ pj = Uk,i for k G {1, . . . , A - 2}, and i G 
{0, . . . , A}. Indeed, let us fix some k G {1, . . . , A — 2}, and i G {0, . . . , A}. Then 



T • = ( - AA^JAa,* ^fc-i (4 - (k - 2)A 2 + ((A 2 - 2A + 2)(k - 1) - A 2 )^ + A" 1 ^ 

+ (2AA A y A A ,< (4 - (k - 1)A 2 + ((A 2 - 2A + 2)k - A 2 )^ + A" 1 ^ 
+ ( - AA A | fe+1 ) A A ,i ^fc+i (4 - fcA 2 + ((A 2 - 2A + 2){k + 1) - A 2 )^ + A^Si^ . 
Since AA^^i>j = (2A(A — 2)A A ) 1 , it then follows from a straightforward calculation that 



T ■ p I = — + 2(5^ — c^fc+i = Z4,ii 

/ k,i 

completing the proof. 

We now treat the cases k = A — 1 and k = A. Let us fix some i G {0, . . . , A}. Then { T ■ p 
equals 



A-l,i 



Q(A - 2) A (A + A Aii ^(4 - (j - 1)A 2 + ((A 2 - 2A + 2)j - A 2 )i) + 

+ ( ^j^ 2 (A 4 -6A 3 +13A 2 -12A+2)^) A AjJ ^> A _ 2 (V(A-3)A 2 +((A 2 -2A+2)(A-2)-A 2 )^ +A- 1 ^, A _ 2 ^ 
+ Q(A-2) A -H2A 2 -5A+6)^A A , i ^> A _ 1 (V^ 

= A ^ J^ a ~_ 2 ^a-i ( 2 ( A " !) A_1 ( E ( A + J)«W " (A + 0) + 2(A A ~- 2^ ( 2A2 " 5A + 6 )^-A-i 

(50) + 4(A ~_ 1 ^ 2 " 2 (A 4 - 6A 3 + 13A 2 - 12A + 2)tf i)A _ 2 j , 
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and ( T ■ p ) equals 

A,i 



( - A(A - 1)(A -j)(2j + A - 3)) A Ail (u 3 (4 - (j - 1) A 2 + ((A 2 - 2A + 2) j - A 2 ) + A" 1 ^ 

(51) = A Ajl f ( A 3 - 4A 2 + 4A - 2) + ( A 2 - 3A + 3)i - ( A - 1) ^ ( A - j) (2j + A - 3)5 i}j 
\ j=0 

where both (50) and (51) follow from Lemma 3 and a tedious calculation (the details of which we 
omit). The remainder of the proof then follows from a straightforward case analysis. □ 

9.6. Proof of Lemma 9. 

Proof of Lemma 9. Let us fix some j € {0, . . . , A}. It follows from Corollary 3 and a straight- 

1 A_1 

forward calculation that — ^](A + k)pj equals 



2A 

fc=0 



A A ,^(2A 2 (A-1) A ) 1 ^A(A-l) A -A 3 -A+2+(A-l) 2 (A+(A-l) A - 2 )j^-(2A 2 )- 1 J^A+k)6 jik 

Alternatively, p A equals 

A A /(2A 2 (A - 1) A_1 ) -1 ( - (A 2 + A + 2) + A(A - l)j) + A" 1 *,- A 



k=0 



The remainder of the proof then follows from a straightforward case analysis. □ 
9.7. Proof of Lemma 10. 

Proof of Lemma 10. Let us define a vector d = (di, . . . , d A +i) by 



(52) dj 



r/ A J = A + 1; 

j = A; 

i-i 

^(j-i)(d A 7r i + d A+1 p++p°) j €{!,..., A- 1}. 



i=0 



That d satisfies (L), (II), and (A) follows from (35). We now show that d satisfies (III) and (IV). 
It follows from Corollary 3 and a straightforward calculation that 

A — 2 

(53) d A " 



A(A- 1) A - 2 ' 
and 



(54) d A -i = (A (A - 1 
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,A-1\-1 



Combining (53) and (54), it follows that — dA-2 + 2dA-i + T fpX-i + Pa-i ec l ua l s 

" ( A(A A _" 1 )A- 2 ) + 2 ((A(A - I)*" 1 ) - 1 ) + rf ((2A 2 )- 1 A A ^-i ( - (A + 2) + A(A - 1))) 
+ (a a ,a-i(2A 2 )- 1 (^^) a - 1 (4 + A 2 -A 2 (A-1))) = 0, 

verifying that (III) holds, and — <1a i + rf p\ + p% equals 

- ((A(A - I)*- 1 )" 1 ) + rf ((2A 2 )- 1 A a ,a( - (A + 2) + A 2 ) 



+ (a a ,a(2A 2 )- 1 (^-|) a - 1 (4 + A 2 -A 3 )) = 0, 

verifying that (IV) holds. 

We now show that di, . . . , dA > 0, completing the proof. Note that for i € {0, . . . , A — 2}, 
dATTi + d^+ipf + Pi equals 



r? ^A^Aa^-CA + ^ + A*)) + ^Aa ) ,((2A 2 )- 1 (|^) A - 1 
|)A-i (i _ 2 ) + A-XoY 



(4 + A — A 2 i) + A-Mi.o 



\ iA-'(|- 2 



Thus for alH <E {2, . . . , A — 2}, d A 7Tj + dA +i p+ + Pi > 0. It follows from (36) that the sequence 
{dj + i — dj, j = 1, . . . , A — 2} is monotone increasing. However, it follows from (53) and (54) that 



lA-l-dA- 2 = (A(A-1) A - 1 )- 1 



; A(A-1) A - 2 
A 2 - 3A + 1 



A(A - 1 



,A-1 



<0, 



where the final inequality follows from the easily- verified fact that A 2 — 3A + 1 is non-negative for 
all A > 3. Thus dj+i — d < for all j € {1, . . . , A — 2}, or equivalently {dj, j = 1, . . . , A — 1} is 
monotone decreasing. As (54) implies that dA-i > 0, we conclude that di, ... ,dA > 0, completing 
the proof. □ 

9.8. Proof of Lemma 11. Before completing the proof of Lemma 11, we prove an auxiliary 
lemma, demonstrating the non- negativity of certain functions of A. 

Lemma 13. 

2A 3 - 11A 2 + 19A - 11 > for all A > 3. 

/A-2 5A-9 \ ,A-2. A , (\ (A-2) 2 A-2 2A-2\ r 

maX {3A^-^UA^T8)- { A^l ) " - mm U' 2A2-7A + 7 '2^'5A3lJ A > 3. 

A-2 A _, (A-l)(A-2)(A-3) 
l A-l j " 2A3 - 11A2 + 19A - 11 /0 " 
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A — 1 

Proof. It follows from elementary calculus that rj(x) = ( ) x is an increasing function of x 



x 



□ 



on (l,oo). The lemma then follows from a straightforward but tedious computation (the details of 
which we omit). 

We now complete the proof of Lemma 11. 

PROOF of Lemma 11. Let us define a vector d = (di, 



(55) 



(2A 

3-1 



2\-l, 



l-2( 



.A-2, 
A - 1 



A-l\ 



,d A +i) by 

j = A + 1; 
J = A; 



X)CJ - 0(dAir< + d A+1 p+ - p°) j G {1, . . . , A — 1}. 



i=0 



That d satisfies (I), (II), and (A) follows from (35). We now show that d satisfies (III) and (IV). 
It follows from Corollary 3 and a straightforward calculation that for all A > 3, 



(56) 
and 
(57) 



dA-2 = (A(A - 2 



,A-4\-l 



l-( 



d A 



(A(A-2) A - 2 )- 1 (l-( 



A-2 A-1 2A 2 - 7A + 7 
A^T^ (A-2) 2 

A - 2 A _ X 2A - 3 



A-l- 



A-2 



That (III) and (IV) hold then follows from a straightforward but tedious computation (the details 
of which we omit). 

We now show that di, . . . , d A > 0, completing the proof. Note that 



(58) 

(59) 
and 
(60) 



di = for all A > 3, 



d 2 = A-i((^V-^-l) 
vv A-r A-2 ; 



for all A > 3, 



~ 5A-9 A-2 A _ 1 14A-18 
3 2A(A-2) ll A-r 5A-9 



for all A > 4. 



Also, for all A > 7 and i G {5, . . . , A — 2}, d A 7Tj + d A+ ip^ — p° equals 
(61) 



A AiJ (A 2 (A-2))- 1 (a(A-2)+(A-1)((5A-6)(^-^) a - 

It follows from Lemma 13 that for all A > 7 and i > 5, 

-A - 2 -a-ia .2 



■3(A-2))i+(A-l)(A-2)(l-2( 



A-2, 
A-l 



A-2, 



(A - 1)(A - 2)(1 - 2(^- T ) A - 1 )z 2 > 5(A - 1)(A - 2)(l - 2(^- T ) A - 1 )i. 
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Combining with (61) and Lemma 13, we conclude that for all A > 7 and i € {5,... , A — 2}, 
dA^Tj + ^A+ipf — Pi is at least 

Combining with (36), we find that for all A > 7, the sequence {dj + i — d,, i = 4, . . . , A — 2} is 
monotone increasing. However, it follows from (56) and (57) that dA-i — dA-2 equals 

(A - 1)(A - 2)(A - 3) / A - 2 A _! 2A 3 - 11A 2 + 19A - 11 \ 
1 ; A(A-2) A -! VA-1 (A-l)(A-2)(A-3) J ~ ' 

with the final inequality following from Lemma 13. Thus for all A > 7 and i € {4, . . . , A — 2}, 
dj+i — dj < 0, or equivalently {dj, i = 4, . . . , A — 1} is monotone decreasing. As it follows from (57) 
and Lemma 13 that dA-i > for all A > 3, we conclude that for all A > 7, d4, . . . , dA-i > 0. 
Combining with (55) - (60) and Lemma 13 completes the proof. □ 
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